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Abstract. Assign i.i.d. standard exponential edge weights to the edges of the complete 
graph Kn, and let M„ be the resulting minimum spanning tree. We show that M„ 
converges in the local weak sense (also called Aldous-Steele or Benjamini-Schramm con- 
vergence), to a random infinite tree M. The tree M may be viewed as the component 
containing the root in the wired minimum spanning forest of the Poisson-weighted infinite 
tree (PWIT). We describe a Markov process construction of M starting from the invasion 
percolation cluster on the PWIT. We then show that M has cubic volume growth, up 
to lower order fluctuations for which we provide explicit bounds. Our volume growth 
estimates confirm recent predictions from the physics literature [18], and contrast with 
the behaviour of invasion percolation on the PWIT [2] and on regular trees [6], which 
exhibit quadratic volume growth. 
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1. Introduction 

Avery recent preprint, written by the author of this paper and Broutin, Goldschmidt 
and Miermont [3], identified the Gromov-Hausdorff'-Prokhorov scaling hmit of the 
minimum spanning tree of the complete graph and described some of its basic properties. 
The current paper complements the work of [3] by identifying the un-rescaled or local weak 
limit of the minimum spanning tree of the complete graph. Both the work of [3] and the 
current work open avenues for further research, into developing a more detailed under- 
standing of the properties of the limiting objects. We highlight some specific questions of 
interest later in the introduction; for the moment, we jump right to the definitions required 
for a precise statement of our results. 

Rooted weighted graphs. A rooted weighted graph (RWG) is a triple G = {G,p,w), 
where G = {V{G), E{G)) is a connected simple graph with countable vertex degrees, 
p G V{G) is the root vertex, and w : E{G) — )• [0, oo) assigns weights to the edges of G. By 
convention, we view unweighted rooted graphs with as RWGs, by letting all edges have 
weight 1. 

If G = (G, p, w) is an RWG and G' is a connected subgraph of G containing p (i.e., G' 
is a graph with p E V{G') C V{G) and E{G') C E{G)), then {G' , p,w\e{g')) is another 
RWG, and is a sub-RWG of G. We sometimes write (C, p, w) instead of (C, p, w\e[g')) for 
succinctness, when doing so is unlikely to cause confusion. Given S C V{G) with p & S, 
we write G[S] for the sub-RWG of G induced by S. 

For an RWG G = (G, p, w), the graph distance dc : V{G) x V{G) — )• [0, oo] is given by 

dQ{u,v) = inf : P a path from u to v in G} , 

for u,v € V{G). (Elsewhere we let inf = oo, but note that here the infimum is non-empty 
as G is connected.) We similarly define the weighted graph distance by 

dQ{u, v) = inf < w{e) : P a path from u to u in G 

[eeB{P) 

For V G V{G) and x > 0, we write Bq{v,x) = {u G V{G) : dQ{u,v) < x} and B'(^{v,x) = 
{u G V{G) : d'Q{u, v) < x}. We say G is locally finite if |-Bq(?', x)| < oo for all x >0. 

Prim's algorithm/invasion percolation. We briefiy recall the definition of Prim's 
algorithm (also called invasion percolation) on an RWG. Let G = (G, p, w) be a locally 
finite RWG with all edge weights distinct. 

Prim's algorithm on G. 

Let El = Ei{G) = and let vi = vi{G) = p. 

For 1 < i < \ViG)\: 

-k let Cj = ej(G) G E{G) minimize {w{e) : e = uv, u G {vi, . . . , Vi},v {vi, . . . , fj}}; 
-k let Vi+i = Vi+i{G) = V and let E'j+i = £'j+i(G) = Ei U {ej+i}. 

If G is finite then, writing n = |y(G)|, the graph (F(G),£'„) is the unique tree T = 
{V{T),E{T)) with V{T) = V{G) and E{T) C E{G) minimizing Eeei?{T) "'(e); in other 
words, it is the minimum weight spanning tree (MST) of G. If G is infinite, however, 
{vi,i > 1} may be a strict subset of V{G), in which case the tree constructed by Prim's 
algorithm is not a spanning tree; we call it the invasion percolation cluster of G. 

There is a large corpus on minimum spanning trees/invasion percolation clusters of 
random graphs, with contributions from both the combinatorics and probability commu- 
nities (we refer the interested reader to [3] for a detailed bibliography). In many settings, 
minimum spanning trees and forests turn out to be intimately linked to an associated 
percolation process. This can serve as both a motivation and a warning (to justify the 
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latter epithet, we record Newman's observation [24] that, weighting the edges of Z*^ by iid 
Uniform[0, 1] edge weights, the invasion percolation cluster has asymptotically zero density 
if and only if the percolation probability 6p^{7J^) = 0). 

Local weak limits. Our current aim is to study the local structure of the MST of the 
complete graph. More precisely, for n > 1 we write Kn for the graph with vertices 
[n] = {!,..., n} and an edge between each pair of vertices. Let Wn = {VF„(e) : e E 
E{Kn)} be iid Exponential(n — 1) edge weights, and write K„ = Then let 

M-n = (H)-£'n(K„)) be the minimum spanning tree of K„, and let M„ = (M„, 1, Wn) be 
the RWG formed from M„ by rooting at the vertex 1. The primary contributions of this 
paper are to identify the local weak limit of M„ and to prove volume growth bounds for 
the limiting RWG. 

In order to precisely state our results, we first recall the notion of local weak convergence. 
For unweighted graphs, such convergence was introduced by Benjamini and Schramm [9]. 
The formulation we use here is essentially that of Aldous and Steele [5]. For e > 0, we 
say two RWGs G = (G, p, and G' = {G',p',w') are e-isomorphic, and write G ~e G', if 
there exists a bijection (p : V{G) — )• V{G') that induces a graph isomorphism of G and G' , 
such that (^(p) = p' and such that for all edges uv E e(G), \w{uv) — w' {(j){u)(j){v))\ < e. If 
e = we say that G and G' are isomorphic and write G ~ G'. 

We may define a pseudometric (Iq on the set of locally finite RWGs as follows. Given 
RWGs Gi = {Gi^ pi^wi) and G2 = {G2, P2,W2), we let 

(io(Gi, G2) = 

where 

n = sup {A: E N : 35 E [-k-\ k-^] , G[{k) -^-i G'^ik + 5)} . 

Note that do(Gi,G2) = precisely if G ~ G'. Now let c^lwc be the push-forward of do 
to the set G* of isomorphism-equivalence classes of locally finite RWGs.^ It is straightfor- 
ward to verify that (^*,dLwc) forms a complete, separable metric space, and we refer to 
convergence in distribution in this metric space as local weak convergence, or sometimes 
simply as weak convergence. 

1.1. Statement of results. As is the case for many random combinatorial optimization 
problems on the complete graph, the local weak limit of M„ is naturally described in terms 
of the Poisson-weighted infinite tree (PWIT) of Aldous and Steele [5]. The PWIT is the 
following random RWG. Let U be the Ulam-Harris tree; this is the tree with vertex set 
Un>o (write N'' = {0}), and for each k > 1 and each vertex v = (ni, . . . , n^) E N'^, an 
edge between v and its parent (ni, . . . ,nfc_i). (If A: = 1 then the parent of v is the root 
vertex 0.) Independently for each v = (ni, . . . , n^) E V, let {wi, i > 1) be the atoms of a 
homogenous rate one Poisson process on [0, 00), and for each i > 1 give the edge from v 
to its child v' = (ni, . . . , n^, i) the weight TV({f , v'}) = Wi. Writing W = {W{e), e E E}, 
where E is the edge set of U, the Poisson-weighted infinite tree is (a random RWG with 
the distribution of) the triple U = (C/, 0, VF). 

For any vertex u E V{U), write T^") for the invasion percolation cluster of U^"-* = 
{U, u, W). The tree T^®^ will play a particularly important role, so we write T = r(0) and 
write T = {T,$,W). Temporarily let Mq be the union of all the trees {T^'^\u E V{U)); 
Mq should be thought of as the minimum spanning forest of U with wired boundary 
conditions. Next, let M be the connected component of Mq containing the root 0. In this 
paper we establish that M = (M, 0, W) is the local weak limit of the minimum spanning 
tree M„ = {Mn,hWn). 



When referring to an equivalence class [(G, p, w)] € Q* , we will typically instead write (G, p, w); this slight 
abuse of notation should never cause confusion. 
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Theorem 1.1. As n ^ oo, we have (Mn, 1, Wn) — ?■ (M, 0, W), in the local weak sense. 

Theorem 1.1 in particular extends a resuh of Aldous ([4], Theorem 1). There is a.s 
a unique edge e G E[M) incident to whose removal separates from oo (this follows 
from Corollary 7.2, below, which states that M is one-ended). Write F for the component 
containing after e is removed. Correspondingly, let be the edge of M„ whose removal 
minimizes the size of the resulting component containing 1 (with ties broken by choosing 
the component containing the smallest label, say), and write for this component. Aldous 
proved that (F„,l,py„) converges in distribution in the local weak sense, to an almost 
surely finite limit, which in our setting has the distribution of (-F, 0, W). The convergence 
of {Fn, 1) to (F, 0) follows immediately from Theorem 1.1. 

The proof of Theorem 1.1 will also provide an explicit characterization of the limit, which 
is straightforward enough to yield an exact, though somewhat complicated, description of 
the distribution of the degree of the root vertex in M. This description is provided in 
Section 2.3. 

Our second main result is to show that the limit M has roughly cubic volume growth. 
Theorem 1.2. There is C > such that almost surely 

r |i3M(0,r)| 

lim sup < oo , 

r-5.00 r^eC log ' r 



and almost surely 



limint!|M<»#>0 

r— >oo 7"J / log r 



This volume growth agrees with recent predictions from the physics literature [18], and, 
as shown by Theorem 1.3, stands in contrast to the volume growth of T = (T, 0,VF), the 
invasion percolation cluster of {U, 0, W). 

Theorem 1.3. It is almost surely the case that 

y |i?T(0,r)| 

nm sup ^5- < 00 , 

r^oo r^ log log r 

and that for any e > 0, 

. Ji?T(0,r')| 
lim mf -— = 00 . 

r— >-co / log 

In Theorems 1.2 and 1.3, we have stated our bounds in terms of the unweighted graph 
distance. However, using the explicit characterizations of M and T given in Section 2, 
it is a straightforward technical exercise to establish the same bounds for the growth of 
B'^{9,r) and 5^(0, r). 

A similar dichotomy of dimensionality, paralleling the difference in volume growth be- 
tween T and M, appears when studying the Gromov-Hausdoff-Prokhorov scaling limit of 
the minimum spanning tree; see [3] for details. Broadly speaking, the tree T, which is a 
subtree of M, determines the global metric structure of M, in the sense that for "typical" 
nodes u, v of M, the distance duiu, v) is of about the same order as duiu, v), where u and 
V are the nearest nodes in T to u and v, respectively. However, the bulk of the mass in 
M lies outside of the subtree T. Theorems 1.2 and 1.3 may be viewed as a step towards 
formalizing this picture. 

We expect that |i?M(0, '')|/'^'^ forms a tight sequence but that almost surely 

. , |i?M(0,r)| |i?M(0,r)| 
limmf ^ = U, limsup ^ = 00 . 

r^oo r-5-oo 

A proof of any of these predictions would be interesting. It would also be be interesting to 
pin down precisely the almost sure fiuctuations of |i?M(0) r)|/r^ (assuming this is the right 
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renormalization) , or even of |-Bt(0) r)\/r'^. The work of [12] on the exact Hausdorff measure 
function for the Brownian CRT heuristically suggests that the almost sure fluctuations of 
|ST(0,r)|/r^ may be of order log log r, but the connection between the two settings is 
rather tenuous. 

Theorem 1.3 should be compared with [6], Theorem 1.6. The latter shows that, writing 
Y(d) for the local weak limit of invasion percolation on an infinite d-ary tree, |-BY(d)(0) r)\/{dr'^) 
converges in distribution as r — )• oo (and explicitly describes the Laplace transform of the 
limit). Theorem 1.9 of [6] additionally shows that lim sup^.^.^^ E [r^/|i?Y(d)(0) ^)l] < oo. 
However, we do not see how to deduce our Theorem 1.3 from the results of [6]. 

We finish the description of our contributions with a suggestion for future work: it would 
be quite interesting to understand the spectral and diffusive properties of M. Results of 
Barlow et. al. [8] and of Kumagai and Mizumi [20] suggest that for simple random walk 
{xn,n > 1) on M, started from the root 0, we should have P = 0} = n~^^'^~^°^^^ and 
dM_{9,Xn) = n^/'^^°p^^\ However, the volume growth upper bound of Theorem 1.2 is not 
sharp enough to allow such results to be applied "out of the box", and neither do the 
requisite resistance bounds follow immediately from the current work. 

1.2. Proofs of the main results (a brief sketch). The upper bound in Theorem 1.3 
is straightforward. An earlier paper [2] showed that T is stochastically dominated by 
a Poisson(l) Galton- Watson tree conditioned to be infinite. This fact, together with 
existing volume growth estimates for the incipient infinite cluster on trees [7], yields the 
upper bound. For the lower bound in Theorem 1.3, we analyze an explicit description 
of the distribution of T from [2]. This description, reviewed in Section 2.1, below, states 
that T is comprised of a sequence of Poisson(l) Galton- Watson trees, always of random 
size but conditioned to be increasingly large as the sequence goes on, and glued together 
along a backbone. Roughly speaking (ignoring logarithmic corrections), this allows us to 
find a subtree of T distributed as a conditioned Poisson Galton- Watson tree with around 

vertices, and containing a backbone node of distance about r from the root. Such a 
(sub)tree also has diameter roughly r, and this yields the lower bound. 

To prove Theorems 1.1 and 1.2, we introduce, and then study, a two-step construction 
of the minimum spanning tree M^. The construction is more elegant in the n — )• oo limit, 
so we now outline the construction for M = (M, 0, W). 

We associate to each vertex v S V{M) an arrival time a{v). Recall that T = (T, 0,Ty) 
is the invasion percolation cluster of {U, 0, W); its distribution will be explicitly described 
in Section 2.1. Given a vertex v £ V{M), if w G V{T) then set a{v) = 1. If v ^ V{T) 
then write for the shortest path in U from v to T (this path lies within M), and let 
a{v) = max{W{e), e G E{Py)}. Next, for A > 1, let M(A) be the subtree of M with vertex 
set V{M{X)) = {v e V{M) : a{v) < A}. It wiU turn out that, almost surely, 1 < a{v) < oo 
for ah V V(T), so M(l) = T and M = hmA^oo M(A). We view M as being built in two 
steps; first, the tree T is constructed (via Prim's algorithm, say). Second, the remaining 
vertices of M are dynamically exposed by letting A grow from to oo. 

For A > 1 let M(A) = (M(A),0,1^). It turns out that, conditional on T = M(l), the 
stochastic process (M(A), < A < oo) is Markovian (this is not a surprise) and its transition 
kernel has an explicit and pleasing form. This picture builds upon on the description of T 
from given in Section 2.1, so we postpone the details until Section 2.2. 

We now sketch the dynamics of {M{X),X > 1). In brief, at each time A > 1, there is 
some set of inactive vertices I\ in M(A) - these vertices form a subtree of T containing 0. 
The process A > 1) is decreasing in A, it satisfies limA^i Ix = V{T), and there is some 
almost surely finite time A* at which \I\* \ = 0. 

Once a vertex v is active, subcritical Poisson Galton- Watson trees begin to attach 
themselves to v according to a particular inhomogeneous rate function which decays ex- 
ponentially quickly in A; this process happens independently for each active vertex, and 
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is responsible for the growth of M(A) as A increases. When a tree attaches to v, all its 
vertices immediately become active. We dub this a Poisson Galton-Watson aggregation 
process. 

By combining the volume growth upper bound for T from Theorem 1.3 with a direct 
analysis of the Poisson Galton-Watson aggregation process, we are able to prove the volume 
growth upper bound from Theorem 1.2. It seems likely that the lower bound should also 
be provable "in the limit", but I have been unable to achieve such an argument. For the 
lower bound, I instead resort to a somewhat involved second moment calculation, based on 
an analysis of modified versions of Prim's algorithm, started from three distinct vertices 
(it turns out to be necessary to consider two vertices aside from the root, which is why 
three distinct vertices are needed for a second moment argument). 

Finally, the proof of Theorem 1.1 proceeds by analyzing a hybrid MST algorithm. Given 
a small input parameter e > 0, the we first Prim's algorithm for a random number of steps, 
halting at a (stopping) time r that is 0(en) with high probability^. This is the finite-n 
analogue of the limiting "step one" and builds a random RWG M„(r) whose distribution 
is close to that of T for e small. Starting from M„(r), we then run Kruskal's algorithm 
to complete the construction of M„. This second phase corresponds to the limiting "step 
two". Its relatively straightforward analysis in Section 6 will yield Theorem 1.1. 

1.3. Outline. Before turning to proofs, we briefly outline the remainder of the paper. 
In Section 2, we describe a Markovian construction of the invasion percolation cluster 
T = (T, 0, W) that was established in an earlier work [2] and that will be exploited several 
times in the current paper. We additionally we describe a continuous-time process that 
builds M from T; this process is at the heart of the proof of Theorem 1.1 and is also used 
in proving Theorem 1.2. Finally, in Section 2.3 we note some distributional identities that 
may be obtained from our results. 

In Section 3 we analyze the early stages (the first o(n) steps) of Prim's algorithm on 
K„, and show that the RWG thereby constructed has T as its local weak limit. This fact 
allows us to define a finite-n growth procedure analogous to (M(A), A > 1), which is used 
in proving Theorem 1.1. 

In Section 4 we establish some useful properties of the decay of weights along the 
unique infinite path in T. These properties come into play in Section 5, in which we prove 
Theorem 1.3, and also in proving the lower bound of Theorem 1.2. 

In Section 6, we prove Theorem 1.1. As discussed in the proof sketch above, this proof 
is based on the analysis of a hybrid MST algorithm. 

In Section 7 we prove Theorem 1.2. The upper bound is based on a direct analysis of the 
process M(A), and ultimately reduces to bounding the value of a certain iterated integral. 
The lower bound, based on a second moment argument which is the most technical element 
of this work, occupies the majority of Section 7. 

Finally, we alert the reader that just before the bibliography, we have provided a list of 
some of the notation used in the paper, with brief reminders of the definitions and with 
page references. 

1.4. Acknowledgements. I started this work in July 2012, after Itai Benjamini and 
Shankar Bhamidi separately asked me what was known about the subject. I thank both 
of them for the encouragement to pursue this line of inquiry. Throughout my work on 
this project I was supported by an NSERC Discovery Grant and by an FQRNT Nouveau 
Chercheur grant. 

2. The distributions of T and of M 
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2.1. An explicit construction of T. The following description of the distribution of T 
is given by Theorem 27 of [2]. Say that an edge e G E{T) is a forward maximal edge if 
the removal of e separates from infinity and if, for any other edge e' E E{T), if the path 
from e' to contains e then W{e') < W{e). Write 5*1 = 0, and list the forward maximal 
edges of T in increasing order of distance from as {{Ri, Si+i},i > 1). The removal of all 
forward maximal edges separates T into an infinite sequence of random trees {Pi,i > 1), 
where for each i > 1, Ri and Si are vertices of Pi. 

For each i > 1, let Zi = \V{Pi)\ and let Xi = W{{Ri, Sj+i}). The sequence {{Xi, Zi), i > 
1) turns out to be a Markov process, whose distribution we now explain. 

For A > 0, write PGW(A) to denote a Galton- Watson tree with offspring distribution 
Poisson(A). We recall that for < A < 1, the random variable | PGW(A)| has the Borel- 
Tanner{X) distribution, given by 

P {I PGW(A)| = m} = — P {Poisson(Am) = m - 1} = . (2.1) 

m m [m — Ij! 

For A > 1, write ^(A) = P {| PGW(A)| = oo}; the function 9 is strictly increasing, and for 
A > 1 is infinitely differentiable and concave.'^ For A > 1 we also write B\ for a random 
variable whose distribution is a "truncated, size-biased" analogue of the Borel- Tanner 
distribution: the distribution of Bx is given by 

. e(A) e-^"^(Am)™-i , , 

^'^^-"'-(i^ (m-i). ■ 

The fact that Ylim>i — (m-T)! ~ ^^^^ preceding equation indeed defines 

a probability distribution, is proved in [2, Corollary 29]. Also, equation (4.3), below, 
explains our use of the epithets 'truncated' and 'size-biased' for the random variable B\. 

By the results of [2], ((Aj,Zj),i > 1) is a Markov process taking values in (l,oo) x N, 
with infinitesimal generator k given by 

K[[x,q,{y,m))- ^^^^ (m-1)! [^<"] ' 
In other words, for 1 < y < x < oo and 1 < ^, m < oo we have 

P {Xn+i G dy, Zn+i = m (A„, Z„) = [x,£)} = —— — — . 

v[x) [m — Ij! 

Equivalently (see [2], Lemma 28 and Corollary 29), we have 

P{A„+i edy\Xn = x} = ^^M^ , (2.3) 

y(x) 

and, conditional on A„+i, the random variable Zn+i is distributed as Bx^^^ and is (con- 
ditionally) independent of {{Xi,Zi),i < n). Furthermore, Xi = 9^^{U), where U is 
Uniform[0, 1], and, conditional on Xi, the random variable Zi is distributed as Bxi- To- 
gether with the above generator, this initial distribution specifies the distribution of the 
whole process {{Xi, Zi),i > 1). We remark that for each i > 1, the distribution of Aj+i 
given that Xi = x is the same as the distribution of Xi given that Xi < x. 

Conditional on the sequence {Zi,i > 1), independently for each i > 1, Pj is distributed 
as a uniformly random labelled tree with Zi vertices. (Equivalently, Pi is distributed as 
PGW(A) conditioned to have Zi vertices - this conditional distribution does not depend 
on A > 0; see, e.g., [29], Exercise 5.15.) Finally, for each i > 1, conditional on Pi, the 
vertices Ri and Si are independent, uniformly random elements of V{Pi), and the weights 
{l^(e) : e G E{Pi)} are independent and uniform on [0, Aj] (recall that W{{Ri, Si+i}) = 
Xi). 



The concavity of 9 can be verified using the implicit formula 9{X) 
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The process {Xi,i > 1) is sometimes dubbed the forward maximal process (for invasion 
percolation on the PWIT in [2] and for invasion percolation on regular trees in [6]). In this 
paper, we instead use this term for the process {{Xi, Zi), i > 1) as a matter of convenience. 

2.2. The Poisson Galton- Watson aggregation process. Recall the description of the 
random RWGs (M(A),A > 1) from Section 1.2. The aim of this section is to explicitly 
describe the dynamics of the process (M(A), A > 1). 

The "percolation probability" 9{x) was defined in Section 2.1. Given c > 1, we also 
define the Poisson Galton- Watson "dual parameter" c* , which is the unique value c' < 1 
for which ce~^ = c'e~^ . (Another identity for c* which we will use later is that c* = 
c(l — 9{c)).) It is straightforward to verify that for c > 1, if T has distribution PGW(c) 
then the conditional distribution of T, given that T is finite, is PGW(c*). We use that c* 
decreases as c increases, which is also straightforward to check. 

For any node v £ V{U), v 7^ 0, write p{v) for the parent of v. Then temporarily write C/„ 
for the subtree of U rooted at v and containing only edges of weight less than W{{p{v),v}). 
Note that for c > 1, conditional on W{{p{v), v}) = c, the tree Uy has PGW(c) distribution. 
If we additionally condition Uv to be finite, then it has distribution PGW(c*). 

For each vertex v £ V{M), there is a unique infinite path P within M leaving v. Write 
x{v) = sup{H^(e) : e £ P}. (If v G V{U) \ V{M) then set x{v) = 00.) Uv£ V{T) 
then, in the terminology of the preceding section, we have x(v) = Xi for some i > 1. If 
V V{T) then consider the unique edge e = {u, u'} of P with u £ V(T), u' V{T). Since 
T is built by invasion percolation (and is locally finite), there is an infinite path within T 
leaving u and with all edges of weight less than x{v). In this case the (finite) portion of 
P connecting v and u' contains a unique edge e' with a{v) = W{e'), and we must then 
have x{v) = W{e') = a{v). Note that the collection of values {x{u),u £ V{M{X))) is 
measurable with respect to M(A). 

Now fix A > 1 and a vertex u £ V{U). U u £ V{M{X)) then let e„ = {u,u'} be the 
smallest weight edge of U incident to u that is not contained in M(A). Necessarily, u' is a 
child of u in U. If u G V{T) then it is possible that x{u) > A. In this case all edges from 
u leaving T have weight at least x{u) > A. However, if u ^ ^(^) then x{u) = a{u) < A. 
Whether or not u £ V(T), given that u £ V{M{X)) and x(u) < A, we have that W{eu) 
has distribution A + Exponential(l). Furthermore, given that u £ V{M{X)) and x{u) < A, 
the edge e is an edge of M precisely if < 00. 

By the above, we have that for small 6 > 0, 

P {W{e) £{X,X + d],e£ E{M{X + 6]),u£ V{M{X)),x{u) < A) | M(A)} 
= (1 + O54o(l))(l - e~'^)P {| PGW(A)| < 00} • l[ueViAm)Mu)<x] 

= (1 + O<54.o(l))'^(l - GW) ■ '^[ueV{M{X)),x{u)<X] , 

and 

P{W{e) £ {X,X + 6],e£ E{M{X + 6]),u£ V{M{X)),x{u) > X \ M(A)} 

= 05;o(l)l[uGy(Af(A)),x(«)>A] • 

Furthermore, given that W{e) £ (A, A + 6] and that e £ E{M{X + 6]), the tree C/„ 
stochastically dominates PGW((A + 6)*) and is stochastically dominated by PGW(A*), 
and a{w) £ (A, A + 6] for all vertices w £ V{Uy) (in particular for w = v). Finally, as the 
edge weights on vertex-disjoint subtrees of U are independent, the events 

{{W{eu) G (A, A + 5]} n {eu £ E{M{X + 5))}, u £ V{M{X))) 

are conditionally independent given M(A), and the subtrees {Uu',u £ V{M{X))) are like- 
wise conditionally independent. 
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The preceding information yields the following description of the process (M(A), A > 1). 
At each time A > 1, there is a set of active vertices (those vertices v G V{U) with x{v) < A). 
Once a vertex v is active, Poisson Galton- Watson trees begin to attach themselves at v; 
at time A, attachments occur at rate (1 — ^(A)). This process happens independently for 
each active vertex. A tree attaching to v at time s is distributed as PGW(s*). When a 
tree attaches to v, all its vertices immediately become active. 

We now add the edge weights to the above picture. The edge weights for M(l) = T 
are already defined. For v S V{U), v ^ 9, the weights of edges in the subtree of U rooted 
at V are independent Exponential(l) random variables, independent of all edge weights 
ouside this subtree. It follows that given that given that x{v) = A (i.e. that v £ V{M{X)) 
and V M(A— )), and given Uv, the weights of edges in Uv are independent, with density 
fx{x) = (1 — e~^) • e~^l[o<a;<A]) and these edge weights are conditionally independent of 
all other weights of edges in M(A). In other words, if a tree T attaches to vertex u at 
time A then the edge e from the root of T to n has weight exactly A, and edges of T have 
independent weights, each with density f\{x). 

Based on the above description, we call the RWG- valued process (M(A), 1 < A < oo) 
a Poisson Galton-Watson aggregation process. Note that we may recover M as M(oo) = 
limA^ooM(A). For a vertex v G V{M), ii v ^ V{M{1)) = ViT) then the value x{v) is 
the time at which v was added during the Poisson Galton-Watson aggregation process, so 
V S V{M{x{v))) but V V{M{x{v)—)). Furthermore, ii v ^ ^(T) then since a random 
tree attached at time x{v) has distribution PGW{x{v)*), the degree deg^(j.(^))(f ) has 
distribution Poisson(x(u)*) + 1, where the 1 accounts for the parent of v. It follows that 
for any v £ V{U), given that v £ V{M), conditional on T and on x{v), the degree deg]^^{v) 
of V in M has distribution 



is almost surely finite; from this it is easily seen that M is almost surely locally finite. 
Also, it will follow from our arguments for the upper bound in Theorem 1.2 that M is 
almost surely one-ended (see Corollary 7.2, below; there should be a simple direct proof 
of this fact, however). 

We will eventually prove Theorem 1.1 by showing that a hybrid construction process 
for Mji, defined in Section 6, converges in distribution to the Poisson Galton-Watson 
aggregation process. 

2.3. A few consequences of Theorem 1.1. We now briefly remark on some conse- 
quences of our weak convergence result and of the above descriptions of T and M. The 
characterization given by Theorem 1.1 is straightforward enough to yield an explicit, 
though somewhat complicated, description of the distribution of the degree of the root 
vertex 0. First, as in Section 2.1, let U be Uniform[0, 1], let Xi = e-^{U), and given the 
value of Xi, let Zi be distributed as Bxi, whose distribution is given in (2.2). 

For integer m > 1, let Um be the distribution of the degree of node 1 in a uniformly 
random labelled tree with nodes {1, . . . , m}; this distribution is explicitly given by 



Let Di have distribution vz^{k)] in other words, given that Zi = m, Di has distribution 
Vrn- Also, let D2 have distribution Bernoulli (l/.Z^i), with D2 conditionally independent 



degriv) + Poisson(/^^^)(l - 9it))dt) ii v £ ViT) 
1 + Poisson(a;(t;) + OJl - eit))dt) \iv^ V(T). 



Since /^°°(1 — ^(t)) < 00, it follows that for all v £ V(T\ the conditional expectation 

E[degM(^^) I v£V{M),x(v),T\ 
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of Di given {Xi, Z\). The description of T given in Section 2.1 implies that degji(0) has 
distribution D\ + D2- 

Finahy, the description of the Poisson Galton- Watson aggregation process impHes that 
given (Xi, Zi), degjv/(0) — degj'(0) is independent of detT(0) and is Poisson(J^^(l — 0(t))dt) 
distributed. Letting Dy, be Poisson (1 — 0(t))dt and be conditionaUy independent of D\ 
and of D2 given (Xi, Zi), we then have the fohowing corollary. 



Corollary 2.1. The random variable deg 



is distributed as Di + D2 + -D3. 



It is interesting to contrast this result with Proposition 2 from [4], which expresses 
deg^(0) as a mixture of Poisson random variables: 



P{degM(0) = ^ + 



Jo 



P {Poisson($(ti)) = i} du . 



where for < u < 1 we set (^(li) = /J* log(l/x)/(l — x)dx. 

We close this section with a final observation. By a classic result of Frieze [16] we have 
that 



E 



2C(3) . 



ee£;(Af):0ee 

Combining this fact with Theorem 1.1 and with Theorem 3 of [23], it follows that for 
U = Uniform[0,l], we have 



E 



-HU) 



t{l - 9{t))dt 



2C(3) - C(2) 



which is perhaps not obvious. 



3. Future maxima in K„, and a strengthening of Proposition 3.1 

The paper [2] shows that the local weak limit of the early stages of Prim's algorithm 
on K„ is given by T = (r,0,W).^ More precisely, for 1 < /c < n let M„^fe be the 
subtree of M„ built by the first k steps of invasion percolation on K„, so Mn^k has vertices 
1 = fi(K„), . . . ,ffc(K„) and edges ei(K„), . . . ,efc_i(K„). Then write M^^k = (M^.fc, l,Wn)- 
Similarly, for A; > 1, write Tk for the tree built by the first k steps of Prim's algorithm on 
U = ([/, 0,iy), so Tk has vertices = vi(U), . . . ,Ufc(U) and edges ei(U), . . . ,efc_i(U), and 
let Tk = iTk,(l>,W). 

Proposition 3.1 ([2]). Fix any function k{n) : N — N such that k{n) — 00 and k{n) = 
o(n}/'^). Then for each n we may couple y^n,k(n) ^'^^ T so that 

P {M„,fe(n) /Tfe(„)} ^0 

as n — )• 00. In particular, yin,k{n) — > T in the local weak sense. 

The RWG T is almost surely locally finite (this is proved in [2] but is also easy to see 
directly), and so necessarily T^ ^ T in the local weak sense, as A; — >■ 00. Using this fact, 
the first assertion of the proposition immediately yields the distributional convergence 
claimed in the proposition. We will in fact require an analogue of Proposition 3.1 that 
holds as long as k[n) = o{n), and now prove such a result. 



See the remark just after Theorem 27 of [2], together with the remark in Section 1.1 of the same paper. 
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Proposition 3.2. Fix / : N — )• N with f{n) — )• oo and f{n) = o(n). Then we may couple 
Mn,/(n) o-i^d T so that 



Jim sup {r : M„j(„) [Sm„,^(„) (r)] = T^(„)[BT^,„j(r)]} 



a.s. 

= OO. 



In particular, M„(/(n)) A T as n ^ oo. 

The proof of Proposition 3.2 will introduce, in simplified form, some of the important 
structures and techniques that will be developed later in the paper. 

We begin by considering Prim's algorithm on the PWIT. Given v S V{U) and z > 0, 
write Uy{z) for the subtree of U consisting of v and all descendents of v whose path to v 
contains only edge of weight less than z. (In the notation of Section 2.2, for v 7^ 0, we 
in particular have Uv{W{{p{v),v})) = U^.) This tree is is distributed as PGW(A) and is 
therefore infinite with probability 9{z). 

Given 2; > 1, let 

g{z) = max{i : W{ei{\J)) > z] , 

or g{z) = if W{ei{\J)) < z for all i > 1. Next, list the indices i for which ei(U) > z as 
(ii, . . . where J = J{z,\J) > is random. Note that for each j > 1, if J > j then 
^Vi.+i{i])i^) is distributed as PGW(2;), and so 

P{J = j\J>j} = 9{z). 

It follows that for ah j > 0, P{J = j} = (1 - e{z)y9{z), and EJ = l/9{z). Also note 
that for j > 0, if J > j then U^^^^j^iz) is distributed as PGW(2;) conditioned to be finite, 
or in other words is distributed as PGW(z*), and so 

E -i,\J>j]=E [\ PGW(z*)|] = 

(the latter equality, easily proved, is re-stated in (4.1) below). Since g{z) = ij whenever 
g{z) > 0, it follows that 

We next turn to Prim's algorithm on K„. Using the same definition of g{z) does not 
turn out to be suitable in this case: with high probability the largest weight edge is the 
last edge added by Prim's algorithm, in which case we would have g{z) = n — 1 for all 
z for which there are edges of weight greater than z in the minimum spanning tree. We 
instead look at the largest edge added before some 'threshold time' k; this is formalized 
in the following definition. 

For z > let K^^ be the subgraph of with edges E{K^) = {e G E{Kn) : Wn{e) < z}. 
Given 1 < j < n — 1 and z > 0, let 

gn{j, z) = sup{e :l<e<j, Wn{ee{Kn)) > z} , (3.2) 

In words, (7n(i, z) is the last time up to step j that Prim's algorithm on Kn adds an edge 
of weight at least z. We wish to bound the upper tail of gn{fn, z), for suitable values of 
m and z. To do so, list the connected components of as = {C^i, . . . ,C^^), in 
decreasing order of size (with ties broken lexicographically, say). Consider the behaviour 
of Prim's algorithm conditional on the sequence C^. Each time Prim's algorithm adds an 
edge connecting to some component C^j, it fully connects C^^ before adding any edge 
leaving j. Furthermore, once ^ is fully connected, the edge leaving ^ connects ^ 
with a uniformly random vertex among all those not yet uncovered. Writing an = crn{z) 
for the number of components uncovered before an edge to is added, for each j > 
we therefore have 

P{cTn=j I an>j,C:,} > 

n 
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We next use two bounds for the sizes of the components of ; the first is due to Stepanov 
[25] (pages 64-65), and the second can be found in [27], Proposition 4.12.^ For any e > 
there is 5 > such that for all n sufficiently large, 

P{|C^,i-n0(z)| >en} <e-'^", ^ {\Cl^^\ >\og^ n] < n'^"^^ . (3.3) 

It follows from the first bound that there is (5 > such that for all i > 1, 

PK>j}<(l-0(^)/2)^+e-^". 

However, for m > (T„ • 2!; if il ^ then necessarily gn{m, z) < an ■ 2!- It then 
follows from the above bounds that if log^ n < m < n{6{z) — e), we have 

P{5n(m,z) > log^n} < P{|C^,2l > log^ n} + P > log^ n} + P < m} 

< n-i°° + (1 - ^(z)/2)'°s'" + 2e-^'' 

< n-^^ , (3.4) 

for n sufficiently large. With these bounds under our belt, we are prepared to prove 
Proposition 3.2. 




Figure 1. The trees T^^^); T^, and Tg*(A:,2) are circled with solid, dashed, 
and dotted lines, respectively. Vertices of T^(2) not in T^, are drawn as solid 
black dots, and vertices of T{k) not in T{g* (k, z)) are drawn as solid grey 
dots. Note that the latter form a subtree of rooted at the far endpoint 

of <ig*(k,z)- 

Proof of Proposition 3. 2. The definitions of the current paragraph are for the most part 
illustrated in Figure 1. For integer fc > 1, let g*(k,z) = g{k,z,Tk), so g*{k,z) is the last 
time Prim's algorithm on U, started from 0, uncovers an edge of weight at least z before 
time k. We always have g*{k,z) < g{z) and so Tg*(j.^z) is a subtree of Tg(^z)- Furthermore, 
if 7^ -^g(z) then g[z) > k and so is a subtree of In the latter case, the 

vertices of T^, not in form a subtree of T^, whose path to the root passes through 

e{g*{k,z)). On the other hand, vertices of ^^(2) not in may in principle attach to any 
point of Tg(2). 

Since Tg*(k,z) '~^g{z) implies that g{z) > k, by (3.1) and Markov's inequality we have 

P {T,.(fc,2) / T,(,)} < j^^^^j^^^ . (3.5) 

Our next aim is to prove an analogous result in the finite setting. Recall the definition 
of gn{z,k) from (3.2). Fix functions f{n) and k{n) with log^ n < k{n) < f{n), with 



There is probably an earlier reference for this rather straightforward result, but I have not managed to 
track one down. 
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k{n) = o{y/n), and with /(n) = o(n). The quantities gn{k{n), z) and Qnifin), z) will play 
the roles of g*{k,z) and g{z), respectively. 

For fixed z > 1, if Mn^g^(^k{n),z) ^n,gn{f(n),z)j then Prim's algorithm adds an edge of 
weight at least z at some step i with k{n) < i < f{n). The latter implies that gn{f{n), z) > 
k{n), and since k{n) > log^ n, by (3.4) we then have 

P {Mn,g„(k{n),z) + M„,^„(^(„),,) } < P {<7„(/(n), z) > k{n)\ < , (3.6) 

for n large enough. 

By Proposition 3.1, we may couple M„ and T^,(„) so that 

Also, whenever Mn^k{n) = Tfc(„), we have g*{k{n),z) = gn{k{n),z) and so T{g* {k{n), z)) = 
^n,gn{k{n),z), and it follows that 

P {T^g*{k{n),z) / ^n,gn{k{n),z)} . (3.7) 

Combining (3.5), (3.6), and (3.7), we obtain that 

P {T,(.) / M„,,„(;(„),,)} < + -L + o(l), 

which tends to zero as n — )• oo, for any fixed z > 1. Finally, fix an integer r > 1. If 
sup{i : Xi > z} > r then B'r^{%,r) C V{Tg(^z))i and it follows that 

P {B'^{^,r) C y(r3(,))} > 1 - P {sup{i : X, > 4 < r} . 

By the expression (2.3) for the conditional densities of the Xi and the fact that 9'{1 + e) = 
2 + o(e) as e I 0, it easily follows that 

P {sup{i ■.Xi>l + 3"''} < r} ^ 

as r — oo. It follows that for any e > 0, for all r sufficiently large, writing z = 1 + \/W 
we have P {sup{i : > z} < r} < e, and for such r we obtain that 

P {M^j(n){r) / T^(„)(r)} < P {B'^[%,r) ^ y(r,(,))} + P {T,(,) / M„,,„(^(„),,)} 

= e + o„^oo(l) • 

As e > was arbitrary, this completes the proof. □ 

4. Properties of the forward maximal process {{Xn, Zn),n > 1) 
4.1. The tails of the random variables {By,y > 1}. 

For < A < 1, the first two moments of |PGW(A)| are given by the following simple 
formulas [11]: 

E[|PGW(A)|] = ^, E[|PGW(A)|2] =^^^. (4.1) 

Also, from (2.1) and Stirling's formula, it follows that 

P {| PGW(A)| = W - • Jt, • (Ae(-^))-- , 

as m — 7- oo, uniformly in < A < 1. Using explicit error bounds for Stirling's approxima- 
tion, it is not hard to see that in fact, for all m > 1 and < A < 1, 

1 . (^g(i-A))m-i < p II pGW(A)| =m}< ■ (Ae(^-^))"^-^ . 



3j^3/2 m^/'^ 
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We leave the detailed verification of these inequalities to the reader. Using that Ae^ ^ < 
g_(i_A)2 all < A < 1, and that Ae^"^ > e-2(i-^)' for A sufficiently close to 1, it 
follows for all m > 1 and for all < A < 1, 

P{|PGW(A)| =m} < ^ •e-"^^!-^)'. (4.2) 

Next recall (from Section 2.2) the definition of the dual parameter u*: for u > 1, u* is 
the unique value A G (0, 1) with ue~'^ = \e~^. It is straightforward to check that PGW(u), 
conditional on the event that | PGW(«)| < oo, has the same distribution as PGW(u*). It 
is also easily checked that (1 + e)* = 1 — e + o(e) as e | 0. 

Fix y > 1 and let By be as in (2.2). We next state probability bounds for the upper 
and lower tails of By when y is near 1. 

Lemma 4.1. There is yo > 1 such that for all 1 < y < yo and any constants < c < 1 < 
C, we have 

Proof. Writing Cy = 6{y)y* /y9' (y), straightforward calculation shows that we may rewrite 
the probability P {By = k} as 

P {By = k} = -^§/f-r -k ' 



yO'{y) k\ 

= cyk-P{\PGW{y*)\ = k} . (4.3) 

Since PGW(y*) is distributed as PGW(y) conditioned to be finite, (4.3) may explain why 
we referred to the law of By as a truncated, size-biased version of the law of PGW(y). 

The above asymptotics for 0{y), 9'{y) and y* near y = 1 yield that Cj,~y — 1 asy|l. 
The above upper bound for the tails of the random variables | PGW(A)| then implies that 
for all y > 1 and > 1, 

^{By = k]<^^e~'^--y'^' 

Using (4.3) and the y i 1 asymptotic for Cy, we obtain that for y sufficiently close to 1 and 
any < c < 1, 

«<c/(i/-l)^ 

<7c,(Lc/(y-l)2j)V2 

< Sc^/^ . (4.4) 

Similarly, assuming yo is chosen small enough that (1 — yo)/(yo — 1) > 1/2, we have that 
for all 1 < y < yo and for all C > 1, 

' J i>C/{y-iy ^ ' 

<7c,(LC/(y-l)2j)V2^^^-^ 

< 20C^/2e^^/2 < 20e-^/* . (4.5) 



□ 
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4.2. The growth and decay of {Xn, n > 1) and of (Z^, n > 1). In this section we state 
three lemmas that will be used in Recall from Section 2.1 that for n > 1, the conditional 
density of Xn+i given that X„ = x is given by f{y) = 9'{y)/6{x) for y G (l,x), where 
6{X) = P{| PGW(A)| = oo}. In other words, under this conditioning, Xn+i is distributed 
as Xi conditioned to satisfy Xi < x. The function 9 satisfies 

e'{l + e) = 2 + o(e), 

as e I 0, and so 6{l + e) = 2e + o(e). It follows that for large n, the ratios — 1)/(X„ — 

1) are approximately distributed as Uniform[0, 1] random variables, and so typically the 
difference — 1) should decrease by a factor two after increasing n by a constant amount. 
The next lemma bounds the probability of seeing "halving times" that are substantially 
longer. 

For z > 1, let I{z) = mm{i : Xi < z}. Let xi > 1 be such that 9'{xi) = 1; since the 
function 9 is concave, such xi is unique. For e > 0, write 

= I{xi) V max{A; : 3n, 1 + e < Xn+k < xi,Xn+k - 1 > {Xn+i - l)/2} . 

In words, is the greatest number of steps required for the difference (X„ — 1) to fall 
below xi, or to reduce by a factor of two once below xi, before Xn drops below 1 + e. The 
next lemma provides probability bounds on the upper tail of H^. 

Lemma 4.2. There is an absoute constant ci > such that for all e > and k > 1, we 
have 

P{/fe >fc} <log2(2/e)-e-'=i^ 
Proof. First, for each i > 1, we have 

i i 

P {/(xi) > i} < J] sup P {Xi+i > 2\Xi = x} <YlP{Xi> xi} < (1 - 9{xi)y. 
j=i">"i i=i 

Now fix e > and fc > 1. By our choice of ci we may assume that e is small, and in 
particular that e < xi — 1. Note that since 9 is concave, 9{xi) > (xi — l)^'(xi) = xi — 1 
and so necessarily xi < 1. It follows that \{ > k then either /(xi) > k/2 or else for 
some 1 < i < log2(l/e) we have 

#{i : 9{xi)/2'+^ < Xj < 9{xi)/2'} > k/2 . (4.6) 

Temporarily write jo for the first j for which Xj — 1 < 9{xi)/2^. If (4.6) is to hold then 
we must in particular have Xj^^i^i — 1 > (Xj^^g — l)/2 for each 1 < i < k/2. By the 
Markov property the probability of the event in (4.6) is therefore at most 

( sup P {Xj+i - 1 > x/2 I - 1 = x} 

\0<a;<2:i-l 

Since x < xi and ^'(xi) = 1, by convexity, for all 1 < x' < x we have 1 < 9'{x) < 2. Since 
the conditional density of Xj+i at x' given that Xj = x is proportional to 9'{x'), it follows 
that P {Xj+i - 1 > x/2 I Xi - 1 = x} < 2/3, and so 

P {#{i : 0{x,)/2'^+' < X, < 0(xi)/2*} > k/2} < (2/3)^/^ . 

By a union bound it follows that 

P{H, >k}<{l- 0(xi))^-/2 + log2(l/6)(2/3)'=/2, 

and takng 5 small enough that e~^^'^ > max(l — 9{xi), 2/3) completes the proof. □ 

Later, we will also need the following tail bound on the total number of vertices in trees 
whose forward maximal weight is above a given threshold. 
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Lemma 4.3. There exist constants C2, Ci > such that for all r > 1 and x > 1, 

Zi>xrA < Ci log r • g-^^^'^' . 

J:Xi>l+l/r J 

Proof. By adjusting the value of Ci we may assume that x is at least j > 12 log2(0(a;i) -r). 
In the proof of Lemma 4.2 we showed that P {I{xi) > < (1 — 9{xi))'^ and that 

inf P{Xj+i<l + x/2\Xj = l + x}>l/3. 

0<x<xi—l 

Writing io = /(I + 1/r), it follows from the above results that for i > 1, 

P{io > j} < P{/(xi) > j72}+P{Bin(ri/2l,l/3) < log^iOix,) ■ r)} < {l-eixi)y/^+e-^/^\ 

the last inequality holding for j > 121og2(0(xi) • r). It follows that for such j, 

P i Yl > ^^"^^ > < P {^0 > j} + jP {^i+i/r > xr^} 
[l<t<io J 

< (1 - + e"^/^® + 20ie-^/® , 

the last inequality holding by the upper bound in Lemma 4.1. Taking j = x then proves 
the lemma. □ 

Finally, the following lemma, which establishes bounds on the lower tail of (iT(0,-Rio) 
and of Xig, will be used in proving the upper bound from Theorem 1.2. In its proof, we 
will use the following explicit formula. Let 7~^"^ be a uniformly random tree with vertices 
{1, . . . , n}, and let f i, f2 be independent, uniformly random elements of {1, . . . , n}. Then 
for each l<A;<n — Iwe have 

k 

P{drin){vi,V2) >k} = r\ . (4.7) 

Lemma 4.4. There exists C2 > such that for all r > 1 with 1 + 1/r < xi, and all x > 0, 
writing io = /(I + 1/r), we have 

P {dT(0, Rio) < xr or Xi„ < x/r} < C2X^I^ . 

Proof. First, given zq, the density of at li G (1, 1 + 1/r) is ^'(ii)/6'(l + l/r). Since ^'(u) G 
[1, 2] for ah u € (1, 1+1/r), it follows that for all < e < 1 we have P {Xi^ < 1 + e/r} < 2e. 
Next, by the lower bound in Lemma 4.1 we immediately have 

P{Zig < xV^} < 8x. 

On the other hand, {Pig, Rig, Sjq) is distributed as a uniformly random tree, together with 
two independent, uniformly random vertices, conditional on its size Zjp. By (4.7) we thus 
have 



> x^r'^,dp^^^{Rif^,Sio) < cxrj < 1 - 



[cxr] 2 2 

X r — J 



/ [cxr] 

< 1-exp -2 i/(xV) 



< 1 - e-'^' 
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the second inequaUty holding as long as c < 1/4, say. For c > sufficiently small, by 
taking x = c^, it follows from these bounds that 

p[dp^^{R^„Si,)<c'r]<9c\ 

Since (iT(0,-Rio) > dp-^^{Rig, Si^), the result follows. □ 

5. Volume growth in T: a proof of Theorem 1.3 

In the preceding section, Lemma 4.3 proved upper tail bounds for the total size of the 
" forward maximal clusters" added by invasion percolation before a given forward maximal 
edge. In order to prove Theorem 1.3, we need a similar bound for the total diameter of 
such clusters. We ffist prove the requisite bound, then turn to the proof of Theorem 1.3. 

5.1. The diameters of the trees {Pi,i > 1). The subtrees {Pi,i > 1) of T were defined 
in Section 2.1. For i > 1, the tree Pj is distributed as a uniformly random labelled tree 
with Zi vertices. A variety of authors [1, 15, 22, 26] have studied the tail behavior of 
the diameter of uniformly random trees; we will use the following uniform sub-Gaussian 
estimate from [1]. Given a finite graph G, write diam(G) for the diameter of G. 

Theorem 5.1. There exist absolute constants c^jG^ > such that for all n > 1, if T^"^^ 
is a uniformly random tree on labelled vertices {1, . . . , n} then for all x > 0, 

P |diam(r(")) > xV^} < G^e-^--^^'. 

The random variable Zi is distributed as Bz^, so typically has size of order [Zi — 1)^^, 
and the tree Pi should therefore have diameter of order [Zi — 1)~^. The next proposition 
essentially states that the sum of the diameters of the trees Pi, . . . , Pj is unlikely to be 
much larger than the diameter of the final tree Pi . 

Proposition 5.2. There exist constants C4, C5 > such that for all x > 1 and all r > 1, 
P I diam(Pi) > xr I < C5 logr • e'"^'''^^ . 

[i:Xi>l+l/r J 

Proof. Let e = 1/r, and recall the definition of the random variable from Section 4.2. 
By Lemma 4.2, for j > 1 we have 

P {H, > j} < log2(2/e)e-'=^^^ = log2(2r)e-'=i^^ . 

Set k = [log2(6'(j;i)/e)] , where xi is as in Lemma 4.2. For < k < /c[log2(0(xi)/e)J let 
ij = 1(1 + e(xi)/2^), and let £^ = 1(1 + e). 

Note that for for k < k, \i < j then Ik — (-k-i < j- Since Zi is distributed as Bxi, 
it follows from this observation, a union bound, and the upper bound in Lemma 4.1 that 
for 1 < k < k, for all c > 1 and i > 1, 

P {max{Z, : 4-1 < « < 4} > c • 2'^/e(xif, H, < j} 
<j sup p\B^>c-2^''/6(xif} 

<j sup P {B^> c/(x -if} 

6»(xi)/2'=<x<6»(zi)/2'=-i 

<20je~"/s . 

Writing c = c' • 2^~^, we have c • 2^^^ /0(x{f < Ac' ■ 2"(^"'^V^, and so 

P {max{Z, : 4-1 < i < 4} > 4c' • 2'^''-''^'' , < j} < 20ie-^'/8g-2(^-'=)/8 
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Write Ek for the event whose probabihty is bounded in the preceding inequahty. On Ek, 
each of the trees Pi, for (.^^i < i < ik has size at most s := [4c' • 2~('^~^)r^J. Letting T^^^ 
be a uniformly random labeled tree with s vertices, by Theorem 5.1 it follows that 

P {max{diam(Pi) : 4-i < i < 4} > 2c' • 2-(^-^)/V, Ek} 

<j ■ P |diam(r(")) > < Cde'^''-''' . 

On the other hand, if Hf, < j and max{diam(Pj) : 4-i < ^ < 4} < 2c' • 2^(^^'^')/^r for 
each k < k then 

diam(Pi) = ^ diam(Pi) < ^ _^2-i/2 ^ ^ ^•^'^''^ ' 

It follows that 

P <! ^ diam(Pi) > 8c jr \ 

[i:Xi>l+l/r J 

< P{H,>j}+ Yl P{max{Zi:4-i<i<4}>4c'-2-(^-^)r2,//e<i} 

i<fc<fc 

+ p|max{diam(Pi) : 4-1 < « < 4} > 2c'- 2-(^-'=)/V,^fc} 

l<A:<fc 

< log2(2r)e-"i^' + 20je-^'/^ ^ e'^Vs + Qje-^^'"' 

j>i 

< log2(2r)e-'=i^' + 40je~"'/^ + Cde-"'''"' . 

Taking j = f3c' for some small /3 > completes the proof. □ 



5.2. The lower bound from Theorem 1.3. For the remainder of Section 5, for r > 1 
we write iq = io{r) = /(I + 1/r). The key to the lower bound is the following proposition, 
which gives stretched exponential bounds for the lower tail of \BT'{9,r)\. 

Proposition 5.3. There exist constants C5, Cg > such that for all r > 1 and all x > 1, 

P{|PT(0,r)| <r^/x} < C76logr-e-'=5^'^'. 

Proof. Fix k > 1 and let ii > io = /(I + l/r) be minimal so that > r'^/k. By the lower 
tail bound in Lemma 4.1 we have 

P{k>io+j}<i8/k'/y , 

for all j > 0. Furthermore, by Theorem 5.1 and a union bound, for x > 1 we have 

plk<io+j, Y diam(Pi) > xjr/k^'^ I < j • Qe-'^^x^ ^ 

io<i<ii J 

A result of Luczak and Winkler [21] implies that uniformly random rooted labelled trees 
are stochastically increasing. In other words, given 1 < m < n, it is possible to construct 
a pair {tm, tn) such that tm and t„ are uniformly random labelled trees on {1, ... , m} and 
on {1, . . . ,n}, respectively, and such that is a subtree of This fact implies that, 
writing s = [r^/A;], we may find a subtree T^'') of Pj^ so that {T^^\Si^) is distributed as 
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a uniformly random rooted tree with s vertices. It follows that for all x > 1, 
P [\Bp^^{Si,,xr/k^/^)\ < r^k} <P [\BT{Si,,xr/k^/^)\ < r^k} 

= P|diam(r(*)) > xr/A;i/2| 

Finally, if i?T(0, 3xr) < r'^/k, then either J2i:Xi>i+i/r diam(Pj) > xr, or X]io<i<ii diam(Pj) > 
xr, or \Bp^^{Sij^, xr)\ < r'^/k. By Proposition 5.2 and the preceding bounds (the first two 
applied with j = k^^"^), we then have 

^) + (A:'/' + 1)^46-'=^"'. 

Taking k = x"^ yields that there exist constants c, C > such that 

P{BT(0,3xr) >rVx2} < C log r • 6"'=^''" , 
which completes the proof (take r' = xr so that rVx^ = {r'f/x^). □ 

We conclude the section by proving the lower bound from Theorem 1.3. 
Theorem 5.4. For any e > 0, we have 

pjlimmf If^'t" >l| = l. 

r— >oo 1 log r J 

Proof. Fix any non-decreasing function x : (l,c«) — )• (l,c«). If \Bi{(ll,r)\ < r'^/x{r) for 
arbitrarily large r, then we must also have that also have that |5t(0,2*)| < 4r^/x(2'') for 
infinitely many i. It follows that 

P{|ST(0,r)| < r^/x (r) i.o.j < P { |St(0> 2*) I < 4 • 2^Yx(2*) i.o.} . 

By Proposition 5.3 we have 

^P{|St(0,2^)| <422Vx(2*)} <^C6log(2*)- 6-^5^(2^1/8 _ 

i>l i>l 

Taking x{r) = log^^*^ r, the latter sum converges, and the result follows by Borel-Cantelli. 

□ 

5.3. The upper bound from Theorem 1.3. Let (TnC)0) be the PGW(l) incipient 
infinite cluster, with root 0. In other words, this is a PGW(l) Galton- Watson tree with 
root 0, conditioned to have infinite size (the existence of such a law was shown by Grimmett 
[17], and was later extended to non-Poisson branching distributions by Kesten [19]). We 
shall use Theorem 3 from [2], which provides an explicit coupling showing that (T, 0) is 
stochastically dominated by (TnC)0)- Iii other words, we may work in a space in which 
(TnC)0) is almost surely a rooted subtree of (T, 0). 

Next, for > 1, let {T^^q , p'') be a Galton- Watson tree with Binomial(A:, l/k) branching 
distribution and root p'^, conditioned to be infinite. From the fact that the Binomial(A;, l/k) 
law converges in total variation to the Poisson(l) law, it is easily seen that (Tjj(-,,p'^) 
converges in the local weak sense to (TnC)0) as fc — )• oo. We may therefore work in a 
space in which (Tjj(^,r*^) ^ (Tnci0)) or in other words, for all r G N there is an almost 
surely finite kr such that for all k > kr, 

Tllc(r)=^T^lc(r). 
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From this fact, together with the stochastic domination of T by Tnc, it foUows that for 
any r > and m > 0, we have 

P{|BT((0,r),0)| >m} <P{|i?T„c(0,OI > W 

= hm p{|i?T. (/,r)| >m| . 

We now use a bound of Barlow and Kumagai [7] (Proposition 2.7), which states that there 
exist constants cq, ci such that for all /c G N and all A > 0, 

v{\B^.Jp\r)\>\r'}<c,e'^^\ 

In fact, in [7] the bound is not asserted to be uniform in k but this is easily verified to be 
a consequence of the proof. It follows that for all r > and A > 

P{|5T(0,r-)| > Ar^} <coe-'=i^. (5.1) 

We conclude Section 5 by proving the upper bound from Theorem 1.3. 

Theorem 5.5. There exists C > such that for any e > 0, we have 

V I^T(0,r)| 

hm sup -IT- — < C > = 1 . 

r^oo log log r J 

Proof. Since for r large and r < s < 2r we have log log r < log logs < Sr^loglogr, it 
suffices to prove that there exists C > such that 

P{|5t(0,2^)| > C71ogi-22* i.o} =0. 

Taking C = 2/ci, by (5.1) we have 

J];p{|St(0,2*)| >C\ogi-2'^'] < CO- J];e-^i(^'°s*) < coY,^'^ < ^ 

i>l i>l i>l 

and the result follows by Borel-Cantelli. □ 

6. Proof of Theorem 1.1 

Recall from Section 3 that for 1 < A; < n, M„^fc is the subtree of built by the first k 
steps of Prim's algorithm on K^, started from vertex vi{Kn) = 1- 

Let k{n) = [log^ n] . In what follows we always assume n is large enough that k[n) < n. 
By Proposition 3.2 and Skorohod's representation theorem, we may work in a space in 
which 

lim sup{z G N : M„ i = TJ =• oo , (6.1) 

n— >oo ' 

and do so for the remainder of the proof. 

In this section we will write both Vi = fi(K„) E V{bAri,i) and vi = Vj(U) E y(Tj), and 
likewise write both = ej(K„) and Cj = ej(U), when there is little risk of ambiguity. By 
the comments of the preceding paragraph, at least for fixed i this is not a major abuse of 
notation. 

Next, recall the definition of gn{j^ z) from (3.2) and, for 1 < j < n — 1 and z > 1, let 

dn{j, z) = inf{£ : j < £ < n - 1, Wn{et) > z}, 

or set dn{j, z) = n if the preceding infimum is empty. In what follows we write dn{z) = 
dn{k{n),z) and gn{z) = gn{k{n),z) for succinctness. 

For z > and for 1 < j < n, let Tnij, z) be the cr-algebra induced by {M„_j, I < i < j} 
and by the indicator l[^y^(e )>z]- (This leads to a sort of filtration that is commonly en- 
countered in probabilistic combinatorics. Informally, -F„(j, z) takes us "part way through" 
step j + 1 of Prim's algorithm: we reveal whether ej has weight greater than z, but leave 
the discovery of e^-'s endpoints and precise weight for later.) Note that while dn{z) is 
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random, it is a stopping time for the filtration {Tn{j, z),l < j < n} and so Fn{dn{z), z) is 
a (T-algebra - see [28], A 14.1. Also, M-n^^^i^^) is measurable with respect to Fn{dn{z) ■, z) . 

We now run Kruskal's algorithm starting from the graph consisting of M^^^^^) together 
with the MSTs of the components of disjoint from Mn^dniz)- More precisely, for A > z, 
let Fn^ be the subgraph of M„ with vertices {1, . . . , n and edges 

{e e E{Mn) : e E S(M„,rf„(,)) or Wn{e) < A}. 

We define Fn^ similarly, but with the requirement that Wn{e) < A. For v V {Mn^dn(z)) 
we let Xn{v) = inf{A : v G F^''**}. Write 

GniX) = aiKlO< t<X) = (T(Tyn(e)l[M/„(e)<A],e G E{Kn)) 

for the cj-algebra containing all information about the graph process (K^,0 < t < A), and 
likewise define ^^(A— ). We then have that Fn and Fn ~ are measurable with respect to 
Fn,z,x = a{Fn{dn{z),z) U ^„(A)) and Fn,z,\- = a{Fn{dn{z), z) Ut/„(A-)), respectively. 

Let Mn^ be the subtree of Fn^ consisting of all nodes in the same component of Fn^ 
as 1 = vi whose path to vi in Fn^ contains no node Vj with gn{z) < j < dn{z), and let 
M.^''^ be the associated random RWG. Next, recall the definition of M(A) from Section 2.2. 
For 1 < i < g{z) let M^'''' be the subtree of M(A) consisting of all nodes whose path to the 
root = of M(A) contains no node Vj with j > g{z), and let M^'''* be the corresponding 
random RWG. (Likewise define M^'"^ ,M^'^~ ,yin^ , and M^''*'~ in the obvious ways). In 
what follows we write and for Mn°° and M^'°°, respectively. 

Lemma 6.1. For any z > 1 we have limA-s.oo hmsup„_^oo P |Mn'^ 7^ = 0. 

Lemma 6.2. For any fixed X> z we have A M^'''* as n —t- 00. 

Assuming the two lemmas, the proof of Theorem 1.1 is easily completed. By the defini- 
tion of g{z), the edge eg(^) = {p(t'g(2)+i), t'g(2)+i} is almost surely the last edge of weight 
at least z added by invasion percolation on U. It follows that eg(z) is on the unique infinite 
path from the root in T, and that for all i > g{z) + 1, Vi is a descendant of Vg(^z)+i- 
Furthermore, T is locally finite and g{z) — )• 00 as z 4- 1- It follows that for any fixed 
r E (1, 00). We thus have 

By (6.1), it follows that 



hmP{d'T(0,t;g(,)) >r} = 1. 



limliininfP 5'n(2)) >r| = 1. 



24.1 

From these facts, it follows that 

limP|5(.(0,r) C y(r^)| = 1 and limliminf P {BL (1, r) C y(M^)| = 1 . (6.2) 

24.1 ^ 241 n-s>oo " ■' 

Finally, it was observed in Section 2.2 that M = M(oo) is almost surely locally finite, and 
by Corollary 7.2, below, we have that M is almost surely one-ended. It follows that is 
almost surely finite, and so for any fixed z > 1 we have 



lim p|m^'^ / m4 = 0, 



A— >oo 

which combined with Lemmas 6.1 and 6.2 yields that A M^. Together with (6.2), this 
implies that 

M'„(r) A M'(r) 

(recall from the introduction that for an RWG G, we write G'(r) for the sub- RWG induced 
by the set of nodes at weighted distance at most r from the root). Since r was arbitrary. 
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this proves Theorem 1.1. We now turn to the proofs of Lemmas 6.1 and 6.2. In proving 
both lemmas, we wiU use the following definition. For z > 1 and v G V{Kn) write 



Xniv,z) 



max{Wn{ej),i < j < if v = Vi,i < dn{z) 

max{Wn(ej), < j < i} iiv = Vi,i> dn{z) 



We also recall from Section 2.2 that for v G V{M), x{v) is the largest weight of any edge 
in the unique infinite path in M starting from v, and that x{v) = a{v) for v V{T). 

Note that for any A > z and any v G V{Kn), the random variable z)l^Xn(v,z)<\] is 
-?>i,2,A-nieasurable. Note also that for j < gn{z) we have max{W„(ej),z < j < dn{z)} = 
max{Wn{ej) , i < j < gni^)} by the definitions of gn{z) and of dn{z). Also, since g{z) 
is almost surely finite, by (6.1) we have gn{z) g{z) and so almost surely, for all n 
sufficiently large, we have gn{z) = g{z) and Xn{vi,z) = x{vi) for all 1 < i < gn{z)- 
Furthermore, for gn{z) <i < dn{z), necessarily Xn{vi,z) < z. 

Proof of Lemma 6.1. For z < \ < oo, note that the component of Fn'^~ containing 1 = vi 
is precisely Mn^d„{\)- Indeed, by the definition of (i„(A), the vertices of M^^d^iX) 
precisely those vertices of M„ joined to Mn^d„{z) by a path all of whose edges have weight 
less than A. These are precisely the vertices joined to M^^d^iz) by Kruskal's algorithm 
started from Fn^ and stopped at weight A—. 

Next, for z < A < oo, suppose that C is a component of Fn^~ disjoint from Mn^d„{x) 
and that C is joined to M^d^ix) at time A, by some edge {v,w} with v G V{Mn^d„{x)) 
and w G V{C). By the symmetry of the model, v is equally likely to be any vertex 
V G V{Mn^^^(^x)) with Xn{v,z) < A (and can not be any vertex v with Xn{v,z) > A). But 
almost surely 

{v G F(M„,rf4;,)) : xn{v, z)>X} = {v„ l<i< 5„(A)} C {v„ l<i< gn{z)} = V{M^'') . 

Since V{Mn^) C V{Mn'^~), this implies that for any X > z, the end point in 1/(M„_^^(;^)) 
of a new connection at time A is uniformly distributed over 

{vi,gn{X) < i < dn{t)} D V{M^^d„^x)) \ V{M^'^-) . 
Since also |y(M^'°°)| = n, this immediately yields that for all z < A < oo. 

Next, since g{z) is a.s. finite and gn{z) ^ g{z) in the space where (6.1) holds, it follows 
that for all e > there is A^^ > such that for n large, 

^{gn{z) > iVj < e/3. 

Now fix e > and < a < 6'(z)/2 smah enough that 1/(1 - a) < 1 + e^/{3N^). Next, 
for any A > 1 write A^^x = {\dn{X) —n6{X)\ < an\}. Reprising the argument for (3.4), for 
n large enough, if A^^x fails to occur then either |Cn^2(A)| > log^n or J„(A) > log^ n or 
\Cn,iW — n9{X)\ > an/3, so for any fixed A > 1, for n large, 



P{^n,A}< 



n 



-99 
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For any e < a, combining bounds from the last three displayed equations, we obtain 
that 



P{\V{M^n\>m/e'} 



e2 



< P{gn{z) > iVj + P{A^,J + — E [\V{M:^n\MA„,.,g^i.)<N,]] 

e 1 A^, 

< — \ 1 • n ■ 

3 3N, 9{z) - a 

< e, 

for n large. The penultimate inequality follows from (6.3) applied with X = z and the 
tower law (since Mn^ = Mn^g^^z) by definition). The final inequality holds since 9{z) —a> 
e{z)/2>a>e. 

Finally, by our choice of a, and since 0{\) — )• 1 as A — >■ oo, we may choose \ > z 
sufficiently large that l/(6l(A) - a) < 1 + e'/{?,N'^). By (6.3) we have 



E{|y(M„^'-)| - |y(M^-'^)| I < im,?'") 

On An^x n {\V{Mn'^)\ < 3Nje^} we have 



n 



dn{X) 



and so 

P {m^'- / M^} = P{\V{M^nn\ - l^(M^'')l > l} 

< P {Al^x] + P {|^(M„^'-)| > mje^] + 6 

< 3e, 

for n large. As e > was arbitrary this completes the proof. □ 

We now proceed to the proof of Lemma 6.2. It would be possible to prove the lemma via 
an appeal to general theory (e.g. Theorem 4.2.5 of Ethier and Kurtz [14]), but verifying 
the relevant conditions is no simpler than providing a bare-hands proof, so we prefer the 
latter. 

Proof of Lemma 6.2. Fix z > 1. By (6.1) and the comments just before the proof of 
Lemma 6.1, we may work in a space in which almost surely, for n sufficiently large, we 
have gniz) = g{z), Mn^g^^(z)T'g(z), and Xn{vi,z) = x{vi) for ah i < gn{z). We work in such 
a space throughout the proof. 

We begin by considering the case \ = z. The forest Fn^ is just the tree M^^d-aiz) 
together with the components of disjoint from M^^i^^f^^y Since VF„(eg^(^)) > z and 
Wnici) < z for gniz) < i < dn{z), none of Vg^(^z)+2, ■ ■ ■ ,Vd„(z) are incident to any of 
ui, . . . ,Wg„(2). It follows that almost surely M^'^ = M.n,gn(z)- Similarly, for i < g{z), the 
activation time x{vi) is at least z and so almost surely M^'^ = T(g{z)). It follows that 
almost surely M^'^ = M^'^ for n large. 

Now let Xq = z, and for j >0 let 



A 



inf |vF(e) ■.e = {u,y},ue V{M''^^),y ^ V{M''^^)^ . 



The preceding infimum is almost surely finite and attained by a unique edge, which we 
denote fj+i = {uj+i,yj+i}, labelled so that Uj+i e V{M^''^^),yi+i V{M^''^^). Likewise, 
for n G N let An^o = z, and for j > let 

A„j+i = inf {M^(e) : e = {n,y},n E V{Mn'^'),y ^ F(M^'^0} , 
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Z \ Z \ 

and let /nj+i attain the infimum and have endpoints u,ij+i G V{Mn ^),iin,j+i V{Mn ^ 
We will show that for any fixed non- negative integer j, it is possible to couple M^'''^ 

\ z X ■ 

and M^' ^ so that almost surely, for all n sufficiently large, A„j = Xj, and M„' "'^ and 
y[z,^j gj.Q isomorphic as RWGs. Since U is almost surely locally finite, Xj — )• oo almost 
surely as j — )• oo, so such a coupling immediately yields the claimed result. 

For J = 0, we have already established the claim. Now fix j > for which the claim 

z X \ 

holds, and work in a space in which A„,j = Xj and M„' "'^ = M^' ^ for n large (we gloss the 

z X \ 

fact that Mn and M^-^^ are isomorphic rather than identical, for ease of exposition). 
Note that in such a space, we also have x{v) = Xn{v, z) for all v G y(M^''^^). 

Conditional on Xj, on M^'^^* and on {x{v),v G V{M^'^^)), let {Ey,v £ ^(M^'^j)) be 
independent Exponential(l) random variables, and for each v G y(M^'^j) let = 
max(x(w),Aj) + Ey. By the definition of the process (M(A),A > 1), under this con- 
ditioning, Aj+i is distributed as min{£^^ : v G V{M^''^^)}. Furthermore, additionally 
conditioning on Aj+i, we have the following properties: 

(i) the endpoint Uj+i of /j^-i within M^''^^ is uniformly distributed among those v G 
y(M^'^0 with x{v) < Xj+i; 

(ii) the subtree of M^'^^^+i that attaches at time Aj+i (i.e., containing the vertices 
V{M''^^+^) \ V{M''^i)) is PGW(Aj+i)-distributed; 

(iii) we have M^''*'j+i = M^'^^ precisely if the subtree from (ii) is finite, which occurs 
with probability 1 — 6{Xj+i); and 

(iv) the edge weights of the subtree from (ii) are independent exponentials conditioned 
to have value at most Aj+i. 

z X z X ■ 

We next work conditional on A„j, on M;' "'^ and on {xrr{v,z),v G V{Mn' Un- 
der such conditioning, independently for each v G V{Mn^"'^), the smallest weight edge 
incident to v leaving M^'"^"'^ has weight distributed as 



max(xn(v, z), Xj) + Exponential 



n — 1 



n-\V{Mn'^"-')\^ 



z X ■ \ 

Now, almost surely M„' "'^ = M^'^ for n large, and the latter is almost surely finite, since 

for any fixed c > 0, Exponential((n — l)/{n — c)) -^Exponential 1), it follows that we may 
couple so that almost surely Xnj+i = Aj+i for n sufficiently large. Furthermore, under 
the current conditioning, the end point Unj+i of fn,j+i is uniformly distributed among 
those V G V{Mn' with x{v) < An J, and it follows from (i) above that for n large we may 
couple so that Unj+i = Uj+i. 

Conditional on Un,j+i, the second endpoint yn,j+i of fn,j+i is uniformly distributed over 
the set 

V{Kn) \ (y(M^'^"'0} U {yn,^ < i < j, Un,^ = n„,,+l}) . 

This set has size between n—\V(Mn'^"'^) \ — 1 and n — V{Mn'^"'^). Furthermore, we have 

yn,j+i {vi,i < dn(An,j+i)}, or in other words, precisely if 
yn,j+i is not joined by Prim's algorithm before time dn{Xn,j+i)- To bound this probability, 
fix any a > 0, and define the event ^ra,Aj+i as in the proof of Lemma 6.1. Since Aj+i is 
almost surely finite, for n sufficiently large we have P|A„^Aj+i} < n~^^. Furthermore, 
since almost surely Aj+i = Xn,j+i for n large, conditional on An^\_^^-^ , almost surely for all 
n sufficiently large we have 

T af\ AO/ n-dniXn,j+i) n - dn{Xn,j+l) , s 

1 - 0{Xn,j+i) -2a < < — < 1 - d{Xn,j+i) + 2a . 

n-V{Mn' "'') n-V{Mn' -1 
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Since a > is was arbitrary, it follows by (iii) that we may couple so that almost surely, 
for n sufficiently large, Mn'^"'^+' = Mn'^"'^' if and only if M^-^i+i = M^^Xy 

Finally, given that y^.j+i ^ {vi,i< d„(A„j)}, the vertices in y(M^'^"'^+') \ ViMn'^"'') 
are precisely those of the component of Kn"'^~^^ containing i/nj+i- For a > sufficiently 
small, conditional on An^x , since A„j4.i(l — Xnj+i) < 1, the restriction of Kn^'^^ to 
the complement of {vi,i < dn{Xn,j)} forms a subcritical random graph. It is then stan- 
dard that the component containing Unj+i asymptotically dominates a PGW(Anj-i-i(l — 
6'(A„j-|_i) — 2a)) and is asymptotically dominated by a PGW(An,j+i(l — 6{Xnj+i) + 2a)). 
Since X'^j^i = A„j+i(l — 6{Xnj+i)), it follows from (ii) that we may couple so that al- 
most surely M^'^^+i = Mn^"'^^^ for n large. Finally, by the definition of K„ and of the 
trees M^'"^, the edge weights of the new subtree in M^'"^"'^^^ are independent exponentials 
conditioned to have value at most A^j+i, which together with (iv) immediately allows us 

to extend the coupling to M^'-^i+'^ and M^'^"'-'"^^ . This completes the proof. □ 

7. Volume growth in M: a proof of Theorem 1.2 

7.1. The upper bound from Theorem 1.2. Recall that by our construction of M 
from T, each vertex u G V{M) has a start time x{u), which is the largest weight on the 
unique infinite path in M leaving u. The removal of all edges of T separates M into a 
forest containing infinitely many trees. Each such tree is naturally rooted at some vertex 
V G V{T): we denote this tree Aiv, and write A''^ = |y(A^^)| for its size. Also, for u > 1 
we write A^^(^') for the subtree of M.^ induced by those nodes w with x^w) < i^, and write 
Ny[v) for the size of this subtree. In particular, we have A^t,(c«) = My. 
Now, given v > X> 1 and an integer A; > 1, write 

f -r^ 1 — OiXi) 

nk{X,u)= 11 1 _ * OLXi...dxk, 

J \<Xl<...<Xk<U j^^^ ^ 

and set no(A, v) = 1. 

Proposition 7.1. Fix v E V{U) and A > 1. Then for any v E [A, oo] we have 
E{iV,(z.) I vGV{T),x(v) = X} = Y,nk{X,v). 

Before proceeding to the proof, we note the following corollary. 
Corollary 7.2. M is almost surely one-ended. 
Proof. Applying the proposition with = oo we have 

B{N^ \ ve V{T),x{v) = A} = ^nfc(A,oo), 

A:>0 

which is finite by Proposition 7.3, below. Since T is a subtree of U, and the latter has 
countably many nodes, it follows that A^^ is almost surely finite for all v E V{T). Since T 
is one-ended, the corollary follows. □ 

Proof of Proposition 7.1. Given u E V{M), for k >0 we say that u has level A; in M if on 
the shortest path from n to T there are k + 1 distinct activation times. In other words, 
level zero nodes are nodes of T, level one nodes belong to trees that attach directly to T 
in the Poisson Galton- Watson aggregation process, and so on. We write Ai^{iy) for the 
nodes in Aiv{i^) with level k and write N^{v) for the number of such nodes. We claim 
that for all u E V{U) and all A; > we have 



E 



{ N^{v) I V E y(r), x{v) = X]= nfe(A, v) , (7.1) 
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from which the Proposition immediately follows. The case A; = of (7.1) is trival. By 
the definition of M, the arrival times of connections to v form a Poisson process with rate 
(1 — 0{t)). Furthermore, when a tree attaches at time t, it has distribution PGW(t*) and 
so its expected size is 1/(1 — t*). It follows that given that v G V{T) and x{v) = A, 

E{iVi(i.) I V G V{T),x{v) = >^]= [ ^1^^ , 

which handles the case k = 1. Next, fix A: > 1 and a node w G V{U). Again by the 
definition of M, for any Xk € (A, v), we have 

E I {u e (i^) I : w an ancestor of u} w e Ml{y),x{w) = Xk^ = j — j— dt 

= ni{xk, u) . 



By induction, the conditional density of nodes in A^J;(z^) with x{u) = x, given that v S 



V{T) and x{v) = A, is 4-nk{X, x). We thus have 



b\^N^+\i^)^ V gV{T),x{v) = Xj = r 

J A 



ni{x,v) ■ -^nk{X, x)dx = nk+i{X,u) 



where the final equality follows from the definition of nk-\-i{X,i'). This proves (7.1) by 
induction and so proves the proposition. □ 

We next bound the growth of nfc(A,oo). Notice that since x| = Xi{l — 9{xi)) we may 
re-express nk{X,u) as 

k 

f x* 
nk{X,u)= Yl—7Y±-^ dxi...dxk. 

J X<Xl<...<x^:<Iy -^^ ■'-ly^ -^i) 

Since x^e"^'* = x*e~^i we may again re-express nfc(A,z^), as 

nk{X,v)= 11 • 

J \<xx<...<Xk<v \^ -^i) 

Proposition 7.3. There exist constants c,C>0 such that for all X > 1 with A — 1 
sufficiently small, 

C ^ gC(log(l/(A-l)))i/2 

(A-l)log(l/(A-l)) ^ ^ — ■ 

Proof. First, for any fixed > A, we may rewrite the sum under consideration as 

?^fc(A, v) ■ ^ ni{iy, oo), 
fc>0 i>0 

which will be useful in what follows. We begin by proving an upper bound. Since x* 
decreases as x increases, for > 1 we have 

k 

nfc(z^,oo)<-- / TTe-^^'-'^^*) dxi...dxfc. 

- k\{l- v* f 
Since no(z^, oo) = 1 for all v, we thus have 

k>0 \ ^ J 
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Next, recall that (1 + e)* = 1 - e + ©(e^) as e i 0. It follows that as I 1 , we have 

k 



n 



X<xi<...<Xk<v 



-2(x,-l)+Oiix,-iy ) _ 



1 



^A<xi<...<a;fe<i/ 

k 



1 + 0((X,-1)2) 



dxi . . . dxk 



dxi . . . dxi 



{i + o{u-i)y 
ki 



In 



A- 1 



where the constant implicit in the notation 0(i/ — 1) may be chosen uniformly over v E 
(1, i^o) for any fixed > 1, and uniformly in k and in A G (1, i^)- We thus have 

^nfc(A,zy) = exp( (l + 0(zy-l))ln ^ 

k>0 ^ 



A- 1 



A- 1 



Combined with (7.2), we then obtain that for fixed z^o > 1; for a-ny 1 < X < v < i/q, 



nfc(A, oo) < exp 

fc>0 

^nfe(A,oo) > 



z^- 1 



fc>0 



l-u* I \X-l 



A - 1 



For given A > 1 with A > 1 small, we may optimize the lower bound (up to constants) by 
taking (z^ — 1) = (log(l/(A — 1)))^"*^. A straightforward calculation then yields that there 
is c > such that for all A > 1 small enough, 

^nA,.(A,oo) > 

A;>0 



(A-l)log(l/(A-l)) 



The upper bound is optimized by taking (u — 1) of order ((log(l/(A — 1))) ^/^), which 
then yields that there is C > such that for all A > 1 small enough. 



nk{\, oo) < e' 



C(log(l/(A-l)))V2 



A:>0 



A-1 



This completes the proof. 



□ 



We conclude the section by proving the upper bound from Theorem 1.3. In the proof we 
exploit the description of T from Section 2.1, and invite the reader to recall the relevant 
definitions. Recall also that for z > 1 we write I{z) = min{i : Xi < z}. 

Given r > 1 write Ej. for the event that either Xj(^i^i/r) < l+l/(r log^ r) or dT{^, ^/(i+i/^)) < 
r/log^r or ^i-Xi>i+i/{r\og^ r) > r^log^r. By Lemmas 4.3 and 4.4, for all r sufficiently 
large we have we have 



P{Er} < 



C2 



log4/3 , 



+ Ci log(r log^ r) • e 



-C2 log r 



< 



2C2 

log4/3 ^ 



By Borel-Cantelli it follows that, writing writing L = sup{j : occurs}, we have that L 
is almost surely finite. 

Given a node v G V{T) if v G V{Pi) then x{v) = Xi. It follows that 



E 



^ I Pi,Xi 



y^^nk{Xi,oo) 



k>0 
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Also, the Xi are decreasing, and nfc(A, oo) is decreasing in A, from this we obtain 



E 



i:Xi>l+l/(r log2 r) v<iV{P^) 



{{X„P,),j>l) 

\v{Pi)\-J2MXi 



,00 



k>0 



E 

i:Xi>l+l/{r logV) 

< ^nfc(l + l/(rlog2r),oo)- ^ Zi , 

i:Xi>l/{r\og^r) 



(7.3) 



fc>0 



where in the final inequality werecall that Zi = \V{Pi)\. 

For fixed j > 1, if j > L then by the definition of the event we have -Bm(0) 2-'/j^) C 
U:X,>i+i/(i22J) ^(^0, and Ei:X,>i+i/(riog2r) ^ f"^^^' Applying (7.3), it then follows 
that 

E[|5M(0,2Vi2)| |^->^] </22^-^nfc(l + l/(i22^-)),oo) 



fc>0 



3Clogi/2(j22^) 

l/j22i 



(7.4) 



the second-to-last inequality by the upper bound in Proposition 7.3, and the last inequality 
by a suitable choice of C . 

Finally, if lim supj,_j.oo \BMi9,'i~)\/{r^e^'~"'^°^^^^^') > 1 then for infinitely many j G N, we 
must have \BM{^,2^/f)\ > ■ e^C'io^^l-^iv H') _ ^j^^ ^^^^^ ^^^^^ ^^^y ^^^^ 

by (7.4) and the conditional Markov inequality we have 

P {|SM(0,2Vj2)| > (l?lff ■ e2C'logi/2(2Vi2) for infinitely many j G iv} 



< P{L>£} + 

< P {L > + ^ e 



^p{|SM(0,2V/)|>(2Vj 



2^,3 . g2C"logi/2(2^7j2) 



} 



-C'logi/2(2Vi2) 



and since L is almost surely finite and the sum is convergent, the latter can be made 
arbitrarily small by choosing I large. It follows that 



P <^ limsup|5M(0,r)|/(He 



3„3C" logi/2rN 



> 1 



which establishes the upper bound from Theorem 1.2. 

It is tempting to try to establish a lower bound in a similar manner, using the lower 
bound from Proposition 7.3. However, this proposition only provides information about 
the expected size of the subtrees A^^. For our volume growth upper bound we have used 
total size of each subtree, but for a lower bound information about volume growth within 
these subtrees would be required. 

In the following subsection, we state, without proof, a proposition by which volume 
growth lower bounds for T can be used to obtain corresponding lower bounds for M. This 
proposition, is then immediately used to prove the lower bound from Theorem 1.2; the 
proof of the proposition then occupies the remainder of the paper. 
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7.2. A key proposition, relating volume growth bounds for M and for T. Recall 
the definitions of M„(/c) and of gn{j, z) from Section 3, and of dn{j, z) and oi Tn{j, z) from 
Section 6. 

Fix z > 1, and let k = k{n) satisfy k{n) > log^n and k{n) = o{n). In what follows, we 
will write d = dn{k{n), z) and g = gnik{n), z) for succinctness. Note that while d = dn{k, z) 
is random, it is a stopping time for the filtration {J>i(j, z),l < j < n} and so Tn{d, z) is 
a cT-algebra - see [28], A 14.1. 

The key to our lower bound is the following estimate. Let Hn = Hn{k{n),z) be the 
forest obtained from Mn by removing the edges of Mn{d), so Hn has edges e^, . . . , Cn-i- 
Note that this forest consists of d connected components (trees) , which we view as rooted 
at vi,...,Vd- For 1 < j < d, we write Unj = Un,j{k{n), z) for the vertex set of the 
component of Hn rooted at vj , and for r > 1 write U^j for the set of vertices of Un,j whose 
distance to vj (in Hn) is at most r. 

Proposition 7.4. There is an absolute constant M > 1 such that the following holds. For 
all z > 1 with z — 1 sufficiently small, for any random subset S of {vj,g < j < d} that is 
J- n{d, z)- measurable, and any A> 1, we have 

p (e >-^}>-i^{isi^^}- 0.(1)) (i - ^ -«(.)) . 

Before proving this proposition, we use it to complete the proof of the lower bound from 
Theorem 1.2 

7.2.1. The lower bound from Theorem 1.2. Fix r > 1 and let z = z{r) = 1 + (log"'^'^ r)/r. 
With k{n) as above, continue to write g = gn{k{n), z) and d = dn{k{n), z), and let 

S = S{n,r) = SM„(d)(l,r) \ {vi, . . .,Vg}. 

By definition, S C {vg+i, . . . ,Vd} and S is an J>i((i, 2;)-measurable set. To use Proposi- 
tion 7.4, we need probability bounds on the lower tail of |5|. 

Fix any function f{n) = o{n) with f{n) > log^ n. By Proposition 3.2 we have 

Mnifin) A T 

in the local weak sense. Furthermore, by (3.3) and (3.4) we have P {/(n) < d} > 1 — 
0(n~^^), so for any x > 1, by Proposition 5.3 we have 

P{^M4rf)(l'0 < rVx} < P{i?M„(/{n))(l,0 < rVx} +0(n-99) 

<P{fiT(0,r) <rVa;}+o„(l) 

< Celogre-^^^'^' + On{l) . 

Next, recall the definition of the forward maximal process {{Xi, Zi),i > 1) and of the 
subtrees Pi of T from Section 2.1. Write i = i(T, z) = sup{j : Xj > z — 1}, and let T^^^ be 
the sub-RWG of T induced by the vertices in Pi, . . . , Pj, so T^^^ has X]j=i Wj\ — X]j=i 

vertices. By Proposition 3.2 and and (3.4), we have yin{g) T^^-*, so by Lemma 4.3, for 
any y > 1, 

<Cilog((z-l)-i)e-^^^'''+o„(l) 
< Ci\ogre-''^y"' + On{l) 
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r^/(21og^r) and ^ log re "=21/^/^ < CQlogre ^sa;^/**^ gQ 

P||5| < ;^^^| <2C6logre-'^«"'^' + o„(l) < ^ + o„(l), 



Taking x = log r and y = log r, for r sufficiently large we have r /x — yr /log r > 

Ci 
2 

^-^1 < 2C6logre-'=«-''' + o„(l) < \ 
2 log r J 

the last inequality holding for r sufficiently large. Write A = r'^0{z)'^ / (6 log^ r). For r large 
we have 9{z) ~ 2(log^'^ r)/r, so ^ ~ (log^'^'^^^ r)/6. It then follows from Proposition 7.4 
that for r large, 



19M 

For r large we have A/9{z)^ > r^/(50 log"'^'^^^ r), and also have and M/{z — 1) < r so 
UiesUnJ^''^^ C ^M„(l,2r). It follows that 

r3 ^ 19M 
50 log r J log r 

By Theorem 1.1, the latter bound implies that for all r sufficiently large 



P { iBuAh 2r)| > 22 > 1 - - 



, , , , ] 19M 

P <! \Bm{0, 2r) > ^ > 1 - 



50 log r J log ' r 

Now write rj = e* for i > 1. Then X^j>i log"'^'^ = ^j>x i"'^'^ < 00 and it follows by 
Borel-Cantelli that 

P <^ \BM{0,ri)\ < „^ for infinitely many i I = 0. 

t 400 log Tj J 

Finally, for r G (e\e*+i), if \Bu{0,r,)\ > ^5^3^ then \Bu{0,r)\ > ^55;^^, so 

P |limmf (^\BM{0,r)\ ■ ^^^^ > o| = 1 , 

proving the lower bound from Theorem 1.2. We now turn to the proof of Proposition 7.4, 
which is at the heart of the lower bound. 

7.3. A heuristic argument for Proposition 7.4. Fix e > with (1 + e)9{z) < 1, and 
let En^z be the event that 

d — q d 
t/(z) ■ n tl[z) ■ n 

By (3.3) and (3.4), this event occurs with probability 1 — 0{n~^^). We write E [•] as 
shorthand for the conditional expectation 

E [ • I J^n{d, z)]=E[-\ Fn{d, z)] 1^E„,A , 

and likewise write 

P {•} = P {•, En,z I Tn{d, Z)} = P {• I J-„(d, Z)} , 

each of the second equations holding since E^^z £ J'nid, z). 

We now work conditional on J-n{d, z). Using the notation a V 6 = max(a, b), for I < i < 
j < n — 1 write 

Wnihj) = z\w„{ej)>z] Vmax{W„(ei),i <l<j]. 
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Then for 1 < j < d, F {v £ Unj} is a measurable function of the forward maximal weights 
{w^{j,d),l < j < d}. Note that on En,z we have d < n, so Wn{ed) > z and thus 
w~^{j, d) > z for all I < j < d. Furthermore, 'w~^{j, d) > z for 1 < j < g. Also, w^{j, d) is 
decreasing in 1 < j < d, and for 1 < i < j < dwe, have 

^{V(^ Un,u} <a.s. P{?^ G f/nj} , 

with almost sure equality holding on the event that w^{i,d) = w^{j,d). In particular, 
since w:^{j, d) = z for g < j < d, for such j we have 

P{^; G Un,} >a.s. > + > (7-5) 

P{^ G C/n,,} <a.s. < • 

(7.6) 

For z close to 1, say z = 1 + 7 for 7 > small, ^(z) is near 27. Since, for fixed e, the 
probability P {-E^^} decays exponentially in n, it follows straightforwardly that for any 
fixed vertex w, 

E [\Un,,\ I w = vj,g <j<d] = {l + 0,^0(1))^:^ = . (7.7) 

Together with the weak convergence result Proposition 3.2, the results of Proposition 5.3 
and of Lemma 4.3 suggest that g is around 7"^. Similarly, Theorems 5.4 and 5.5 suggest 
that typically, a substantial fraction of the nodes in M„(g) have distance around I/7 from 
the root vi. If the trees {Un^w^w G V{Mn{g))} were typically of size I/7 (which is plausible 
in light of (7.7)), and additionally were typically of diameter 0(1/7), we would then obtain 
around \V{Mn{g))\/^ ~ 1/7^ nodes within distance I/7 from the root vi. 

There are two problems with this heuristic argument. First, (7.7) does not apply to 
nodes of Mn{g)- Indeed, if vj E V{Mn{g)) and w^{vj,d) = z' > z then the conditional 
expected size of Un,Vj is around \/{z' — 1), which may be much smaller than 7. We address 
this problem by instead considering a suitable collection of around 1/7^ nodes of M„(d) 
that are not in M„((7), but that were added in the early stages of Prim's algorithm (shortly 
after M„((7) was built) and that also have distance around I/7 from vi. 

Second, and more importantly, the (identically distributed) random variables I^Ai,!?^!; 
9 < j ^ d are not concentrated; their distribution is asymptotically that of | PGW(z*)|, 



where z* is the dual parameter to z anc 



(4.1) then says that for such j, E 



is near 1 — 7 for 7 = 2—1 small. In particular, 
is around 1/7^. The correct picture is not that 



the Un,vj are typically of size I/7. Rather, the typical size is 0(1), but an approximately 
7 proportion of the {Un,v ,9 < j 1^ d} have size 1/7^, and the latter typically have height 
of order I/7. To capture this picture, and thereby prove a volume growth lower bound, 
we study the first and second moments of the sizes of a carefully chosen family of subtrees 
of the trees {Un,Vj-,9 < j ^ d}. We now turn to details. 

7.4. The proof of Proposition 7.4. We continue to work conditional on J^nid,z). Fix 
a vertex u V{Mn{d)), and consider the following procedure, which we denote z-Prim{u). 
The short description of the procedure is this: start a tree-building exploration procedure 
from u. Use Prim's algorithm for edges of weight greater than z, and breadth-first search 
for edges of weight less than z; stop the first time a vertex of Mn{d) is added. For the 
sake of clarity, and to introduce some needed notation, we now explain the procedure more 
carefully. 

List the components of -fC^ that are disjoint from Mn{d) as C = C{z) = {Ci{z),i > 1). 
(There are only a finite number of such components, but we gloss this issue to avoid 
unnecessary notation.) Edges within the components of C have weight at most z, whereas 
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edges from these components to one another and to vertices in Mn{d) have weight greater 
than z. 

The vertex u lies in some component from C. Explore this component via breadth- first 
search (exploring the children of a given node in increasing order of label), and write Di{u) 
for the resulting breadth-first search tree. Next, for given i > 1, suppose that breadth- first 
search spanning trees Di{u), . . . , Di{u) of some set of components of K!^ have already been 
constructed. Add the smallest weight edge e from one of Di{u), . . . , Di{u) to the rest of the 
graph (this edge has weight greater than z). If the endpoint v of e not in Di{u), . . . , Di{u) 
is not a vertex of Mn{d), then let Di+i{u) be the breadth- first search spanning tree of the 
component from C containing v, and continue exploring. If v does lie within Mn{d), then 
write a{u) = v and stop. 

Write T{u) for the tree built by z-Prim{u), and write m{u) for the number of compo- 
nents oiK^ explored by 2:-Przm(M)before it stops; so, T{u) is composed of Z)i(u), . . . , L'^(„)(n), 
plus the single vertex a(u), which is the unique vertex belonging to both T{u) and to 
Mn{d). Note that if the components of spanned by Di{u), . . . , Dj^(^^-^{u) happen to be 
trees (we will shortly see that this occurs whp), then T(n) is precisely the restriction of 
Mn to V{T{u)). Note also that the trees {T{u),u V{Mn{d))) need not be disjoint; for 
example, if G V{T{u)) then T{u) shares at least the vertices v and a{v) = a{u) with 
Tiv). 

Let Tu = I + Ya^i^ |A('u)|, and let A„ = m{u) + Ylil^i^ 'iiam(Di{u)), so r„ = \T{u)\ 
and diam(7'(ti)) < A^. Writing 7 = z — 1 as before, we will next show that E [r^] and 
E[A u] are of orders 7 and 7 ^ , respectively. 

The argument of this paragraph is similar to the one appearing just after the statement 
of Proposition 3.2. Each time z-Prim{u) adds an edge not lying within a component of C, 
the vertex that is added is equally likely to be any vertex from an unexplored component 
of C or to be any vertex from {vi,g < i < d}. (It also may be a vertex of Mn{g), but 
this is less likely since, as noted earlier, Tn{d,z) provides "stronger lower bounds" on 
the weights of edges connecting Mn{g) with the rest of the graph.) Now fix £ > 1 and 
condition that m{u) > I, and let v be the first vertex added by 2:-Prim(n) after fully 
exploring Di{u), . . . , Di{u). Then 

P{ve V{Mn{d)) I m{u) >e,Diiu),..., D,{u)} > 

>(l-e)0(z)l[^„,^]. 

Since the right-hand side does not depend on L'i(u), . . . , Di{u), by averaging we thus have 

P{v£ V{Mn{d)) I m{u) > > (1 - e)0(z)l[s„,,]. (7.8) 

Given that m{u) > £ (i.e., that v y(M„(d))), the graph K^[{1, . • • ,n} \ {V{M„{d)) U 
[jl^i Di{u))] is stochastically dominated by K'^_^^_gy and u is a uniformly random vertex 
of this graph. Assuming z < 3/2, say, on the event -En, 2, uniformly in e > sufficiently 
small we have 

(n - (d - g))il - e-^h < n(l - e{z){l - ,)) (i±Ml))f 

n 

< {l + On{l))z{l-e{z) + e) 

<z* + 2e, (7.9) 

the final inequality holding for n sufficiently large. It follows from standard results about 
subcritical random graphs (see, e.g., [10] Corollary 5.24) that for n sufficiently large, 

P{Di+i{u) is a tree | miu) > 1} > (1 - n-^/^)l[E„^^]. (7.10) 
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Also, using (7.9), it is straightforward to check that for e > sufficiently small and n 
sufficiently large, on En^z we have Bin(n — (d — g), (1 — e~^/")) ^st Poisson(z* + 3e), 
where ^st denotes stochastic domination.^ By considering the breadth-first construction 
of D£^i{u), it follows that given that m{u) > I and that D^j^i{u) is a tree, -D£+i(n) is 
stochastically dominated by PGW(2;* + 2>e)J 

Next, (7.8) implies that P{m(n) > l \ m{u) > ^} < (1 - (1 - e)6'(2;))l[£;„^], and so on 
En,z, m{u) is stochastically dominated by a Geometric((l — e)9{z)) random variable. It 
follows that (still conditional on Tn{d,z)), on the event E^^z we have 

G 

Tu ^st ^ \Pi\-, 
1=1 

where G is Geometric((l— e)0(z)) and, independently of G, the (Pi, i > 1) are iid PGW(2;*-|- 
3e). By Wald's identity and (4.1), we then obtain that 

for e > small enough. (Here we are still writing 7 = z — 1, and use that 9{z) ~ 2(z — 1) 
and that z — 1 ~ 1 — z* , both as z ^ 1, in the last inequality.) Also, by (7.10) and the 
stochastic bound on m(u), we easily obtain that 



P {The components of spanned by Di{u), . . . , !)„(„)(«) are all trees} 

/ e{z)-logn \ 

^[^- „l/2 ) ■ MEn,.] ■ 

We may bound E [A^] in a similar fashion to E [r^] ; for this we use that for e sufficiently 
small, z* +3e < 1. The diameter of a PGW(l) tree has finite expectation (this is standard, 
but also follows easily from (4.2) and Theorem 5.1), and we obtain that on En^z, the random 
variable A^j is dominated by the sum of a Geometric((l — e)9(z)) number of iid random 
variables with some finite expectation F. It follows that 

F F 
E [An] < • -^^3^)^ < hEn,.] • - ' (7-12) 

for e > sufficiently small and z sufficiently close to 1. 

By reprising the above argument, we can obtain a stochastic lower bound on t„ that is of 
roughly the same form. First, given ^ < t„, by the tree built by the first i steps of z-Prim(u), 
we mean the subtree of T{u) consisting of the first i vertices added by z-Prim{u). The 
"order of addition of vertices" is well-defined since we explore the subtrees Di, . . . , -Dm{«) 
in order, and within each subtree the vertices are explored in breadth-first search order. 
More precisely, we may think of each step of z-Prim{u) as consisting of the breadth- first- 
search exploration of a single node, plus possibly the connection of a new component from 
C (the latter occurring each time the current BFS exploration concludes). 

Let T~ = min(rj, [enj). The tree built by the first t~ steps of z-Prim{u) is a subtree of 
T{u), and is equal to T{u) precisely if Tu < [enJ . For £ < t~ , at step £ of z-Prim{u), some 
tree Di is partially built. The number of new nodes added to Di in the BFS exploration 
at step i has distribution Bin(n — d — i,l — e"^/"). For e > small, on En^z we have 



For any fixed 2; > and e > 0, for all n sufficiently large, Bin(n, x/n) Poisson(a; + e). 
'''To carefully verify this, one should use that conditional on the vertex set of Di+i{u), the event that 
Di+i{u) is a tree is a decreasing event; as this is rather standard and distracts from the flow of the 
argument, we omit the details. 
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n — d — i > n(l — 26(z)). Since also 1 — e > (1 — e)z/n for n large, on En^z the number 
of new nodes thus stochastically dominates 

Bin ^n(l - 2d{z)), ~ ^^^ ^ ^st Poisson(z(l - 36'(z))) , (7.13) 

the last stochastic inequality again holding for n large. Furthermore, for fixed i > 1, on 
the event that m{u) > i we have 



P < 



m{u) > i 



m{u 



n — (1 + e)9{z)n — en 



n 



> 



l[i^„..](l-2^(^)) 



Together, (7.13) and (7.14) imply that on En 



G- 



T„ hst min([enJ,^|Pi |) 



(7.14) 



(7.15) 



i=l 



where G is Geometric(l— 2^(2;)) and, independent of G , the {P- ,i > 1) are iid PGW(1- 
29iz)). 

For 7 = z — 1 sufficiently small, 37 < 26{z) < 47, so 



1 



1>1 



5O7 J 10 



Also, by (4.1) 



E[l^ri]>^> E[|prH<^, 

so by a routine application of the Paley-Zygmund inequality. 



([1/5071 



> 



> 



1=1 

By the independence of G~ and the P[' , it follows that 



100 



> 



i=l 



4007^ 



> 



1000 



We also have P l-^i I ^ L^^-J | — ^ as n — )• 00, and so by the stochastic relation 

(7.15) we obtain 

1 



4OO72 



> Ir 



9 \ 1 

1000 ~ ') - '^"'"1 ' 125 



for n sufficiently large. Since r„ < Tu almost surely, combined with (7.11) and (7.12) this 
implies that there is C > 1 such that 



1 c 



C72'72 



1 

c 



this bound holds uniformly over all z with z — 1 sufficiently small, over all e with < e < 
eo{z), and for all n greater than some fixed tiq = nQ{z,e). 
Fix ve e V{Mnid)) with g < I < d, and let 



1 C_ 



c 
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For any u V{Mn{d)), a{u) £ {vi, . . . , Vd}- By (7.5) and by symmetry, we have 

1 C" 



P <^ a{u) =ve\Tu£ 



CI 1 

Xu<-\>1[E„. 



7 



and we similarly have F {a{u) = ve} < < - e)6'(n)). On £'„,^, for 

e > sufficiently small, n — d> n(l — 29{z)) and d < {1 + e)9{z), so 

- , , n-d 1 - 2e{z) 

E \NA >1^E 1 r- > Ifs 1 • 7 ■ 7.16 

Now fix an Fn{d, z)-measurable set S C {vg+i, . . . , f^}, and write A^^ := X^j.^.g^ A'j. Note 

that A'j counts a subset of the vertices in U^{'^ = U^{^^ ^\ so to prove Proposition 7.4 it 
suffices to prove that for any A > 1, 

^[^s>^^>(l'{\S\>§^]- o„(l)) (l - ^ - «(.)) . (7,17) 

To prove such a lower bound, we shall use the second moment method; for this we require 
an upper bound on expectations of the form E \NjNi\, for g < j < I < d, and we now turn 
to proving such a bound. 

The principal contribution comes from the case j = /; in this case we seek an upper 
bound on 

u,u'^V{M^{d)) 

We bound the last probability above by considering whether or not u' lies within T{u). 
\i u £ Nf^ then |T(u)| < by the symmetry of the set {w : w ^ V{Mn{d))}, we thus 

have 

Pj-u G Ni,u' G V{T{u))] = P{n' G V{T{u))\u G iV^} G N^} 

(1 — e)0[z)n 

(J 

^1[^".^](^(1_,)^(,)„)2- (7.18) 

We next bound P{n G Ni,u' G Ni,u' V{Tiu))}. li u £ Ng and u' V{T{u)), then in 
order to have u' G A'^, the z-Prim{u') procedure must at some point add a vertex of T{u). 
To bound the latter probability, consider a modification of z-Prim{u') which stops the first 
time either a vertex of Mn{d) or a vertex of T{u) is added. Write P{u') for the last vertex 
added by the modified procedure; then 

P{m G Ni,u' G N^,u' V{T{u))] = P {n G iV^u' V {T{u)), I3{u') G V{T{u))] . 

We have already conditioned on Mn{d) (more precisely, on Tn{d, z)); we now additionally 
condition on T{u). All edges from T{u) to the rest of the graph have weight at least z, 
and it follows by arguing as at (7.5) that 

V {u' ^V{T{u)),P{u') £V{T{u)),u£ N, I T{u)] 

\nu)\ 



< 1 



[«'^y(r(«))P[«eAf,]-L[i?„,,] • 



d-g + \T{u) 



When u £ Ni we have |7'(u)| < and on En^z we have d > (1 — e)6{z) ■ n. It follows 

that 

P {u' V{T{u)),^iu') £ ViT{u)),u£ Ne \ r(n)} < l[,eJV,] • • (1 _ g)V(^) . ^ • 
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The only term on the right that depends on T{u) is l[u^]\j^y, by averaging over T{u) we 
thus obtain 



F{uGN,,u' eNe,u' ^ViTiu))} <P{ueNe}- (7.19) 
^ 7 (1 ~ e)t/(z)n 

C 



Combined with (7.18) we thus obtain the bound 

2C_ 

(7(1 -^{z)nf 

and summing over pairs u, u' y(M„((i)) (there are less than -n? such pairs) yields 

2(7 

In the case j ^ £, the same style of argument works with minor modifications, which we 
only briefly sketch. In order to have u G Nj and u' € A^^ we can not have u' G V{T{u)), 
and must have a{u') = vg. The symmetry of the nodes {vi,g < i < d} is not broken by 
the knowledge that u G Nj, so we have 

P {a{u') = ve\uG N,} < • _ ; 

we thus obtain the bound 

1 



P {n E AT,, n'G AT,} 
and so 

Combining the preceding inequality with (7.21), it follows that for any J-'n{d, z)-measurable 
set S C {vi, g <i <d}, writing Ns = Ni, we have 

By (7.16), we also have 

E[iV5]>l[^„,,] (l + e)^(z) • 

By the preceding inequalities and the conditional Chebyshev inequality ([13], p. 194), for 
any t > we have 

\Ns - E [A^5] I > t} < ^ (e [nI] - E [Nsf 

^ 1 \S\' fl + 2C/{^'\S\) {l-2e{z)f 



Recall that for all z > 1 we have 9{z) < 2{z — 1) = 27. It follows that for z — \ sufficiently 
small, for any D > 1, for n large enough we have 

2' 



30(z)" ' - ^(z)2 J - y " ' - 29{z)'' ' - e{z) 



> H\s\>DC/e{z)^] ■ 1[E„,,] • i 1 - i ^1-^)2 {l-ey 
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By the tower law, we thus obtain 
DC 



3e{z) 



>P{Ns>3^,\S\>^.E, 



3e{z) 



E 



\S\ 



\S\> 



DC 



>P{\S\>DC/e{z)^,En^,}-(l- 



1 + 8/D {l-2e{z)f 



fl 



Since P {En,z} — ?• 1 as n — t- oo, and since e > was arbitrary, it follows that for any 
J>i^rf-measurable S C {vj,g < j < d}, 



P iVs > 



DC 
3^ 



> P l^l> 



DM 
9{zf 



On(l) 



i---ieiz] 



Taking A = D/3 then establishes (7.17) and so completes the proof of Proposition 7.4. 



List of Notation 

a{v) For v G V{T), a{v) = 1; for v V{T), a{v) is the weight of the largest weight 



edge on the path from w to T 5 

Bq{v,x) Ball in G for the distance 2 

Bq{v,x) Ball in G for the distance dQ 2 

Bx Truncated, size-biased Borel- Tanner random variable 7 

Components of K^, listed in decreasing order of size as (C^ . . . , ^). . 11 

dQ Weighted graph distance in G 2 

dc Unweighted graph distance in G 2 

dnij, z) Equals inf{^ ■ j <(.<n-l, W„(e<?) > z] 20 

ei(G) The z'th edge added by Prim's algorithm on G 2 

Fniji z) fj-algebra induced by {M„^i, 1 <i < j} and by the indicator l[^^(ej-)>z]- • • 20 
Fn'^ Subgraph of M„ with edges {e G E{Mn) : e G E{Mn4„(z)) or W„(e) < A}. 20 

9nij, z) Last step before step j that Prim's algorithm on K„ adds an edge of weight > 

z 11 

I{z) Index of first forward maximal weight Xi with Xj < z 14 

K„ The complete graph on [n] with root 1 and with Exponential(n — 1) edge 

weights 3 

Subgraph of K„ with edges E{Kl) = {e G E{Kn) ■Wn{e)<z] 11 

M Component of the wired MSF on U containing the root 0; rM = (M, 0, VF).3 

M(A) Subtree of M with vertex set {v G V{M) : a{v) < A} 5 

Mn The minimum spanning tree of K„; M„ = {Mn, 1, Wn) 3 

Mn^k Subtree of M„ built by first k steps of invasion percolation 10 

z X z X z X 

Mn' Subtree of Fn' induced by the set of vertices whose path to vi in F„' does 

not pass through {vj,gn{z) < j < dn{z)} 20 

M^'^ Subtree of M(A) induced by nodes whose path to vi in M(A) does not pass 

through {vj,j > g{z)} 20 

p{v) The parent of node v 7 

PGW(A) Poisson(A) Galton- Watson tree 6 

{Pi,i > 1) The components of T when forward maximal edges are removed 6 

RWG Rooted weighted graph 2 

eiX) P{PGW(A) = 00} 6 

Tfc Subtree of T built by first k steps of invasion percolation 10 

The invasion percolation cluster of {U, u, W); T = T^^^ 3 

U The Ulam-Harris Tree 3 
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([/, 0, W) 


ine roisson-weignted innnite tree 


. 3 




The i'th vertex added by Prim's algorithm on G 


2 


x{v) 


Weight of largest weight edge in the unique infinite path in M leaving v. 


..7 


Xl 


Unique value for which 9'{xi) = 1 


14 


{{Xi,Zi),i 


> 1) The forward maximal process 


7 
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